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Abstract 



Studying high-dimensional Hamiltonian systems with microstructure, it is an important and chal- 
lenging problem to identify reduced macroscopic models that describe some effective dynamics on 
large spatial and temporal scales. This paper concerns the question how reasonable macroscopic 
Lagrangian and Hamiltonian structures can by derived from the microscopic system. 

In the first part we develop a general approach to this problem by considering non-canonical 
Hamiltonian structures on the tangent bundle. This approach can be applied to all Hamiltonian lat- 
tices (or Hamiltonian PDEs) and involves three building blocks: (i) the embedding of the microscopic 
system, (ii) an invertible two-scale transformation that encodes the underlying scaling of space and 
time, (iii) an elementary model reduction that is based on a Principle of Consistent Expansions. 

In the second part we exemplify the reduction approach and derive various reduced PDE models 
for the atomic chain. The reduced equations are either related to long wave-length motion or describe 
the macroscopic modulation of an oscillatory microstructure. 

1 Introduction 

A major topic in the area of multi-scale problems is the derivation of reduced or effective macroscopic 
models for a given microscopic system. A prototype for this problem is the passage from discrete lattice 
systems to continuum models which describe the effective dynamics on much larger spatial and temporal 
scales. In this case, the microscopic dynamics is governed by a high dimensional system of ODEs, whereas 
the macroscopic models are related to the PDEs of continuum mechanics or thermodynamics. 

In the dynamical setting this problem can be stated as follows: Choosing well-ordered microscopic 
initial data in a specified class of functions, one hopes that the solution will stay close to this class of 
functions. We can interpret the class of functions as an approximate invariant manifold and aim to 
derive reduced equations that govern the evolution on this manifold. Moreover, if the original dynamics 
is related to underlying Lagrangian or Hamiltonian structures, the question arises how these structures 
behave under the reduction procedure. This approach is closely related to the theory of modulation 
equations, sec Mic02, GHM06, SU07 for surveys, which describes how an oscillatory microstructure is 
modulated on the macroscopic space-time. 

In mathematically rigorous terms the transition from a microscopic to a macroscopic scale can be described 
by a coarse graining diagram, which involves the scaling parameter e, see Figure[TJ The curve t <— > z E (t) £ 
M £ denotes the solution of the microscopic model, i.e., it depends on the microscopic time t, and takes 
values in the microscopic state space M e . On the other hand, the macroscopic trajectory r i— ► Z(t) € N is 
parametrized by the macroscopic time r, and describes the evolution of the macroscopic state Z(t) € N. 
The two scales in this problem are linked by a suitable two-scale ansatz, which consists of the time scaling 
t = e@t, as well as a scaling transformation % : M e — > N, which in particular encodes the spatial scaling. 
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Figure 1: The coarse graining diagram 



In the best case the diagram commutes, i.e., if the coarse graining z e (t) Z\ert\ holds at time r = 0, 
then it holds true for a finite time interval r G [0, r^J. Any reasonable micro-macro transition must 
provide an effective macroscopic evolution equation for the macroscopic configuration Z(t) S N. We can 
not expect the macroscopic equation to provide exact solutions to the microscopic system, but we can 
hope that it gives rise to approximate solutions that satisfy the microscopic law of motion up to higher 
orders in e. 



In the standard approach of model reduction one inserts a reasonable two-scale ansatz into the microscopic 
law of motion and derives a macroscopic evolution equation by means of formal expansions with respect 
to the scaling parameter e. 

However, this standard approach ignores the underlying Lagrangian and Hamiltonian structures and 
therefore the following questions arise naturally: (i) Are there macroscopic Lagrangian and Hamiltonian 
structures that correspond to the reduced macroscopic equation? (ii) If yes, how can one derive them 
and what is their relation to the microscopic structures? 

The main issue of this paper is to develop a general framework for micro-macro transitions that relies 
on a two-scale reduction of microscopic Lagrangian and Hamiltonian structures. To this end we split our 
approach into three steps, namely embedding, exact two-scale transformation, and reduction, which can 
be studied independently. Our point of view is strongly motivated by the investigation of microscopic 
lattice systems, where the micro-macro transition replaces a high dimensional system of ODEs by a small 
number of macroscopic PDEs. Nevertheless, our approach to Hamiltonian two-scale reduction can also 
be applied to microscopic PDEs, sec i jl.ll below. Note, that for us a Hamiltonian structure consists 
of a Hamiltonian (function) and a (non-canonical) symplectic form. Alternatively one could study the 
reduction of Poisson structures. 



The Hamiltonian two-scale reduction for lattices always involves the scaling of space and time variables. 
There exists a lot of literature concerning solely the coupling of slow and fast time scales in Hamiltonian 
systems with finite dimension, or fixed spatial scales. The arising mathematical problems can be tackled 
by means of averaging and adiabatic invariants, see for instance |Jar93[ |TR99[ |NV05| . Moreover, a lot 
of work has been done to derive efficient schemes for the numerical integration of such systems, compare 
[UJLL061 |HLW02| . and references therein. 

A second class of micro-macro transitions is related to the passage between different spatial scales. 
For instance, in the static case it is a challenging problem to derive elastic energies from atomistic 
lattice models, and to study the macroscopic convergence of microscopic ground states and energies, see 
[FJOOl IBG02p[ IBG02a[ IFTM IBG061 IThe06l ISch06l IBLM06| . Another kind of spatial reduction arises, 
when the microscopic model combines both large and short space scales. Close to our point of view, 
[GKMS9"5] considers the Euler equations for an incompressible fluid under gravity, and studies the limit 
of vanishing height. It can be shown that the underlying Poisson-structure converges to a limit that 
corresponds to the shallow water equation. Moreover, using similar methods the equations for shells and 
plates can be derived from the three-dimensional models of nonlinear elasticity, sec GKM96 . 
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1.1 Motivating examples 

Let us first discuss two simple examples related to microscopic PDEs which highlight the essential features 
that arise in the general setting. Below we will see that microscopic lattices can be treated similarly if 
viewed as embedded into systems with continuous space variable. 

The first example concerns the passage from the Boussinesq equation to the Korteweg-de Vries (KdV) 
equation. Here, the microscopic dynamics is governed by 

Xtt — ^7777 ^£77777777 ~t~ £77 "£7777; (1-1) 

where the unknown function x depends on the microscopic time t and the microscopic space variable 
r] £ R. Notice that x t and x v abbreviate d t x and d v x, respectively. For simplicity we ignore all boundary 
effects, so that the microscopic configuration space is Q = L 2 (W; dr/). One particular macroscopic model 
for (jl.ip is related to the two-scale ansatz 

x(t, 77) = eX(e 3 t 7 e(?y + t)) (1.2) 

where r = e 3 t and y = e(j] + 1) denote the macroscopic time and space, respectively. The function X is 
the macroscopic configuration and for fixed r it takes values in P = L 2 (R; dy). The scaling parameter 
e > is assumed to be small and bridges the two appearing scales. 

The standard approach for model reduction works as follows: We plug the two-scale ansatz (|1.2[) into the 
microscopic law of motion (jl.ip , use formal expansions with respect to e and equate the terms of leading 
order. For the example at hand one easily derives 

2X T y — ~Xyyyy + XyXyy, (l"3) 

which is a KdV equation for X y . As already mentioned, this standard approach works very well but 
in general it is not clear at all whether the derived macroscopic equation has its own Lagrangian and 
Hamiltonian structures. 

We proceed with the Lagrangian and Hamiltonian two-scale reduction for the Boussinesq example in 
order to illustrate the difficulties that may arise in the general setting as well as the proposed solutions. 
On the one hand, the microscopic Lagrangian C for (jl.ip is given by £ = K, — V with kinetic energy JC 
and potential energy V given by 

K{x t ) = J \x t 2 &ri, V{x)= J \x 2 + \x m 2 + ^x, 3 dr,. (1.4) 

R R 

Identifying the momenta 7r = d Xt C with the velocities x t we find that the microscopic Hamiltonian H. 
equals the energy £ = K, + V. In particular, the microscopic law of motion (jl.ip equals the Eulcr- 
Lagrange equations to C, and is moreover equivalent to the canonical equations to TL, which correspond 
to the symplectic form 



-1 

1 



(1.5) 



with 1 being the identity map Q — > Q. On the other hand, the KdV equation (|1.3|) is the Eulcr-Lagrange 
equation to L rod (V, X T ) = K rcd (V, X T ) - V rcd (V) with 

K red (V, X T ) = J X T X y dy, V red (V) = J \{Xyyf + \{X y fdy. 

R R 

Since L red depends linearly on the macroscopic velocities X T , the reduced macroscopic Hamiltonian 
structure is non-canonical. In fact, the Hamiltonian M red equals the potential energy V rcd and the 
symplectic structure 

<r rcd (V, X) = f XX y dy (1.6) 
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is a skew-symmetric 2-form on P. Consequently, the macroscopic law of motion is given by cr rcd (A T , •) = 
dH rcd , which is a dynamical system on P, and not on TP or T*P. 



In order to describe how the microscopic Hamiltonian structure reduces to the macroscopic one, we regard 
the two-scale ansatz (|1.2j) as a time dependent transformation T con (i, e) : X G P — > Q with parameter e. 
Its canonical lift T ve i(£, e) : TP — ► TQ to the corresponding tangent bundles reads 

T ve i(i, e) : (X, X T ) ^> (x, x t ), {x, x t ){r}) = (eX, e 4 X T + e 2 X v ){ei 1 + et). (1.7) 

This transformation comprises the crucial ingredients of our approach: For fixed e > this transformation 
is exact, this means invertible, but describes explicitly how the macroscopic structures depend on e. 
Therefore, one can read-off the effective structures from the leading order terms in e. 
Applying the inverse transformation of (|1 .7|) to the energies from ()1.4j) we find 



K(e, X, X r ) = e 3 J \{e 2 X T + X y f dy, V(s,X) = e 3 ± J X y 2 + e 2 X yy 2 + \e 2 X T 3 dy. 

R R 

Both transformed energies are of order e 3 . However, the transformed Lagrangian L is of order e 5 , since 
the terms of order e 3 vanish due to cancelation via L = K — V. Thus, we find 

L(e, X, X T ) = e 5 U cd (X, X T ) + 0{e 7 ). 

The transformation of the Hamiltonian structure is not so simple, since the transformation (|1.7[) involves 
a moving frame. The macroscopic Hamiltonian H, i.e. the Legcndre transform of L, is given by 

H(e, X, X T ) = E(e, X, X T ) - I(e, X, X T ) = e 5 H rcd (X, X T ) + 0(e 7 ). 

Here, E = K + V is the transformed energy and I is the transform of X, where I is the conserved quantity 
related to the moving frame by Noether's Theorem: 

X(x, x t ) = j x t x v drj, I(e, X 7 X T ) = e 3 J (e 2 X T + X y )X y dy. 

R R 

We conclude that the transformation (|1.7p provides both the Lagrangian and the Hamiltonian for (|1.3[) 
to leading order e 5 . Moreover, it can be shown that the symplcctic form (|1.5p . considered as a 2-form on 
the tangent bundle TQ but not on the cotangent bundle T*Q, transforms into 

-29, \ T f -1 
o / tu ^ 1 

which equals (|1.6|) to leading order e 5 . Finally, the KdV equation is invariant under shifts in the y- 
direction, and this symmetry gives rise to the conserved quantity 



F ed (X) = J X 2 dy, 



P: 

which turns out to be the lowest order expansions of K(e, •) and V(e, •), namely 

e 3 ¥ cd (X) = 2 K(e, X, X T ) + 0(e 5 ) = 2 V(e, X, X T ) + 0(e 5 ) = I(e, X, X T ) + 0(e 5 ). 

We conclude that the terms which vanish due to cancelation correspond to a macroscopic integral of 
motion. 

As a second motivating example we study the macroscopic evolution of a modulated pulse in the Klein- 
Gordon (KG) equation 

x u = x m - %(x) (1.8) 
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with x = x(t, 77), rj € R, and nonlinear on-site potential <&o- A modulated pulse is an (approximate) 
solution which satisfies the ansatz 

x(r), t) = eA(e 2 t 1 er, - ect)e ±{ult+e ^ + c.c. + 0(e 2 ). (1.9) 

Here c.c. denotes the complex conjugate, the frequency lj and the wave number 9 are fixed parameters, 
and c is the moving-frame speed. The plane waves e 1 ^ ^ in (| 1 .9|) describe an oscillatory micro structure 
whose amplitude A is modulated on the macroscopic scale r = e 2 t and y = e(r) — ct). 

A first necessary condition for (|1.9p to yield approximate solutions is that cu and satisfy the dispersion 
relation uj 2 = 2 + $0(0) and c = —a/ is the associated group velocity. Moreover, the complex- valued 
amplitude A must satisfy the nonlinear Schrodingcr (nlS) equation 

2uj±A t = QiAyy - q 2 \A\ 2 A, 

where the constants g\ and qi can be computed explicitly. The validity of this macroscopic model has 
been proven rigorously in [KSM92] on the level of the equation of motion. 

As in the Boussinesq example, both the microscopic and macroscopic models have Lagrangian and 
Hamiltonian structures and so we are interested in the question how these are related to each other. The 
new feature in this example is the presence of microscopic oscillations and the key idea is to introduce an 
additional one-dimensional, periodic phase variable <fi € T 1 ~ [0, 2ir]. This new degree of freedom enables 
us to find a suitable two-scale transformation such that all (transformed) oscillations are confined in the 
phase direction <fi. This suggests the two-scale ansatz 

x(t, n, (j>) = eX(e 2 t, e(r? - ct), cf> + ujt + 6r/), (1.10) 

which is similar to p.9[) but gives rise to an invertible two-scale transformation. 



The introduction of </> can be viewed as an embedding of the microscopic system, such that ([1.8)1 becomes 

X tt {t, T), <j>) = X m {t, T), <j>) - & (x(t, T), <j>)). 

This embedding does not affect the microscopic dynamics, since </> appears just as a parameter. The 
embedded system has Lagrangian C = K, — V and Hamiltonian H = £ = K + V with 

1^2 1 1 1 \ _ / 12 



K(x t )= J \xi&n&<j>, V(x)= j ±x; + $o(x)dnd4> (1.11) 

RxT 1 RxT 1 

and corresponds to the symplectic form ()1.5p . Moreover, we find two continuous symmetry groups related 
to shifts with respect to rj and <f>, which by Noethcr's theorem correspond to the conserved quantities 
(integrals of motion) 

l spaC c{x, x t ) = J x t x 7] drid(j), l phasc (x, x t ) = J x t x^drjdcj). (1-12) 

RxT 1 RxT 1 

The second integral of motion arises only due to the embedding but plays a prominent role in the two- 
scale reduction, since it is needed for the derivation of the macroscopic Hamiltonian. In fact, the moving 
frame in (|1.10[) involves drifts in space and phase direction and the associated integral of motion reads 

C ^spacc -^phasc ■ 

Like for the Boussinesq example, we can use the transformation (|1.10|) and our general approach described 
below in order to derive the Lagrangian and Hamiltonian structures for the nlS equation directly from 
their microscopic counterparts. It comes out, that the leading order terms determine the microstructure, 
the next-leading order terms give the moving frame speed, and finally, the next-next leading order terms 
provide the macroscopic law of motion. This will be explained in detail within 
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1.2 General approach to Lagrangian and Hamiltonian two-scale reduction 

The concepts arising in the above examples can be generalized to the following abstract framework for a 
Lagrangian and Hamiltonian two-scale reduction. 

The first step concerns the embedding of the microscopic system. We have seen above that the treatment 
of models with microstructure requires the introduction of new phase variables 4>. Moreover, for discrete 
models like chains we replace the particle index j £ Z by a continuous variable r\ G R. In all cases 
this embedding does not change the microscopic dynamics, but it gives rise to new continuous symmetry 
groups and hence to additional integrals of motion which contribute to the macroscopic Hamiltonian. 

In what follows we always consider the Lagrangian C of the embedded system which is defined on the 
tangent bundle TQ of the microscopic configuration space Q. Then there exists an equivalent Hamiltonian 
structure on the cotangent bundle T*Q corresponding to the canonical symplectic form. However, for the 
reduction step explained below it is essential to consider a Hamiltonian TL as well as a symplectic form 
a both of which are defined on the tangent bundle TQ. To this end we pull back the canonical structure 
from T*Q to TQ via the fiber derivative of C. This will be discussed in detail in £12.11 

The most important step in any two-scale reduction is the transformation of the embedded system. 
For this purpose we introduce two-scale transformations by composing elementary building blocks such 
as (weak) symmetry transformations, moving-frame transformations, and scaling transformations. The 
first two building blocks are well understood in classical mechanics, whereas our concept of scaling trans- 
formations seems to be new, since it involves the scaling of space and time. The starting point for any 
scaling transformation is a map <S con : Q — > P bridging the microscopic and the macroscopic configuration 
spaces Q and P. The definition of such a map involves only the scalings of the space coordinates, but its 
lift S vc \ : TQ — > TP to the tangent bundles takes into account also the time scaling. 

Two-scale transformations are in the heart of any two-scale reduction, because they provide a macro- 
scopic Lagrangian L, a macroscopic Hamiltonian H, and a symplectic form a (all defined on TP), which 
depend explicitly on the scaling parameter e. 

The reduction step starts with the formal expansions of the transformed Lagrangian and Hamiltonian 
structures with respect to the scaling parameter e, i.e., 

L(e) = e k (L + eLi + e 2 L 2 + ...), 

and 

H(e) = e K (H + eHi + £ 2 H 2 + ...), cr(e) = e K (er + ea x + e 2 er 2 + ...). 

A key feature of our approach is the Principle of Consistent Expansions which will be proved in £12.11 
and guarantees that (H;, cri) is the Hamiltonian structure corresponding to the Lagrangian Lj. For this 
principle to hold it is crucial to consider the Hamiltonian structure on the tangent (and not on the 
cotangent) bundle. 

For some examples the leading order Lagrangian Lo is non-degenerate. Then the effective macroscopic 
model is completely determined already by the leading order terms. However, whenever the two-scale 
ansatz involves an oscillatory microstructure the leading order terms turn out to be degenerate in the 
following sense: The leading order Lagrangian Lo is quasi-stationary, i.e., it does not depend on X T , and 
this implies Ho = — Lo and er = 0. Moreover, there exists a sub-manifold Pq of P such that the gradient 
of Lo vanishes on P$. In this case we restrict L — Lo and H — Ho and cr to TPo, and derive the effective 
macroscopic model by expanding the restricted structures. 

The reduction procedure concerns the convergence of Lagrangian and Hamiltonian structures as e — > 0, 
but this does not necessarily imply the convergence of solutions. Therefore each reduced model must 
be justified. In the general setting the justification problem turns out to be very subtle and is not ad- 
dressed in this paper. However, for all examples presented here we discuss the corresponding justification 
problem after having derived the reduced Lagrangian and Hamiltonian structures. We also refer to the 
surveys |Mie02([GH M06, SU07] and to |Mie08| for an abstract theory using T-convergence for Hamiltonian 
systems. 



6 



The abstract framework for the two-scale reduction will be developed in detail within §2 where we prove 
the transformation rules for Lagrangian and Hamiltonian structures and discuss the reduction procedure 
in the various cases. Finally, in §2]wc apply this method to several micro-macro transitions for the atomic 
chain. 

1.3 Two-scale reductions for the atomic chain 

The nonlinear atomic chain consists of identical particles with unit mass. The atoms are coupled to a 
background field by the on-site potential $o and nearest neighbors interact via the pair potential $1. The 
microscopic dynamics is governed by Newton's equations 

Xj(t) = ^(x j+1 (t) - X j(t)) - SifoCt) - xj-xit)) - & (xj(t)), (1.13) 

where j £ Z is the discrete particle index and Xj(t) e R denotes the displacement of the j-th particle 
at time t. For <I>o = and an-harmonic $i we obtain the Fermi-Pasta-Ulam (FPU) chain, while Klein- 
Gordon (KG) chains correspond to harmonic 4>i but have an-harmonic $o- 

A general micro-macro transition for the atomic chain is related to the two-scale ansatz 

x j {t)=e a X(e fi t, e(j-ct)) 

with macroscopic time r = e^t, macroscopic particle index y = e(j — ct) and macroscopic configuration 
X. Notice that y is assumed to be a continuous variable and can be interpreted as the coordinate of a 
macroscopic material point. 

In the example part §2] we study the following micro-macro transitions and discuss how the Lagrangian 
and Hamiltonian structures that correspond to the effective macroscopic equations can be derived directly 
from the Lagrangian and Hamiltonian structure of the atomic chain. To this end we embed the atomic 
chain (|1.13|) into a microscopic system with continuous particle index r\ £ R, see §3.11 

Quasi-linear wave equation In §3.21 we consider the FPU chain and rely on the two-scale ansatz 

xj(t) = e~' l X(et, sj), (1.14) 

which has no moving frame and corresponds to the hyperbolic scaling r = et and y = ej. In this case the 
macroscopic evolution satisfies the nonlinear wave equation 

d TT X - dy($[(d V X)} =0. 

KdV equation The second example, see £13.31 concerns the passage from FPU chains to a KdV equation 
by means of a two-scale ansatz similar to (|1.2[) . 

Modulated pulses and the nlS equation In analogy to the second motivating example, in £13.41 we 
study the macroscopic evolution of a modulated pulse in the KG chain. Similar to above, the two-scale 
ansatz reads 

Xj(t) = eA(e 2 t, ej - ect)e i{uJt+e ^ + c.c. (1.15) 

and the macroscopic dynamics is described by an nlS equation. The only difference as compared to the 
case of the continuous KG equation l|1.8p concerns the dispersion relation leading to different coefficients 
in the macroscopic equation. 
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Three-wave-interaction The fourth example, see ij3.5l is the most involved one and describes how 
three modulated pulses interact if they are in resonance. This gives rise to the following ansatz 

3 

x j {t)=e^2A n (et, ej)e i( "" t+e " j) +c.c. (1.16) 

n=l 

with three phases <j) n = oj n t + 9 n j and three amplitudes A n . All pairs p n = (8 n , u n ) satisfy the dispersion 
relation of the KG chain and are coupled via the resonance condition pi + P2 + Vz = in T 1 x M, 
where T k = R fe /( 27r z) fc i s the fc-dimensional torus. This resonance condition shows that we have only two 
independent phases. Moreover, the amplitudes are coupled on the hyperbolic scaling r = et, y = ej via 
the three-wave-interaction equations 

(2u x 0\ (A x \ huwf x \ (AA /M 3 \ 
i 2oj 2 0kU 2 =i 2w 2 ^ \d y \A 2 - $™(0) AiA 3 . (1.17) 
\ 2u 3 J \A 3 J \ 2cj 3 lj 3 J \A 3 J \A 1 A 2 ) 

Finally, in £13.61 we present further examples for micro-macro transitions in the atomic chain. Although 
they fit into the general framework they are postponed to a forthcoming paper as they display additional 
complications. 

2 Lagrangian and Hamiltonian two-scale reduction 

In this section we describe the general framework for the two-scale reduction of Lagrangian and Hamil- 
tonian structures and present our abstract results concerning two-scale transformations and the problem 
of model reduction. Since we are mainly interested in Hamiltonian PDEs and lattices we assume that 
the microscopic configuration space Q is a function space. Moreover, for simplicity we suppose Q to be 
a Hubert space (usually some L 2 -space) with inner product (■, •). 

As a prototypical example for a microscopic Lagrangian we consider a normal system, where the 
Lagrangian C is the difference of quadratic kinetic energy K and potential energy V. More precisely, a 
normal Lagrangian C satisfies 

C(x,xt)=JC(xt)-V(x), K{xt) = \{x t ,Mx t ) (2.1) 

with symmetric mass matrix M : Q — > Q. However, our approach is not restricted to normal systems but 
can be applied to all microscopic Lagrangian structures. We start with some general remarks regarding 
Hamiltonian structures. 

2.1 Hamiltonian structures for given Lagrangian 

In classical mechanics we have (at least) two possibilities to introduce a Hamiltonian structure for a given 
Lagrangian C : TQ — > K, where a Hamiltonian structure consists of both a Hamiltonian (function) and a 
symplectic form. 

The standard approach is related to the canonical Hamiltonian structure on the cotangent bundle 
T*Q. For its definition we consider the Legendre transform TL : TQ — * R of C which is defined by 
Ti.(x, Xt) = (tt{x, x t ), x t ) — C{x, Xt). Here, 7r(x, x t ) = $£\ x (xt) is the canonical momentum associated to 
Xt and is determined by the fiber derivative of the Lagrangian C. This fiber derivative is given by 

: TQ — ► T*Q, (x, x t ) i-> (a:, d Xt C(x, x t )) = (x, tt(x, x t )). 

In the next step we replace the velocity Xt by tt, assuming this is possible, and rewriting TL : TQ — > R in 
terms of x and ir we obtain the canonical Hamiltonian 

W:T*Q^R, H(x, tt) = W(ff£ _1 (a;, tt)). 

The Lagrangian equation to £, i.e. -^tt(x, x t ) — d x £(x, Xt), is equivalent to the canonical equations 
x t = d^TL, TT t = —d x Ti, which can be written as 

a can |-(z 4 , •) = my,-). 



<s 



Here, z = (x, tt) G T*Q, and cr can denotes the canonical symplectic form on T*Q given by 

Pcan\z{z, z) = (iT, x) - (tT, x) , 

with z = (x, 7r) and z = (x, tt) being two independent tangent vectors from T|-T*Q. 

The second Hamiltonian structure lives on the tangent bundle TQ and consists of the Hamiltonian 
H : TQ — * R and a non-canonical symplectic form cr £ A 2 (TQ) defined as the pull-back of cr can via 
i.e. cr = cf can . This means 

<t\ z {z, i) = (i?7rU(i), x) - (Dtt\ z (z), x), (2.2) 

where z = (x, Xt), z,i 6 T^TQ, and i?7r| 2 is the linearization of tt in z. Assuming that is differen- 
tiablc, it can be shown, see [AM78] for a proof, that the Lagrangian equation for C is equivalent to the 
Hamiltonian system 

a\ z (z t ,-) = dH\ z (-). (2.3) 

Remark 2.1. The symplectic form a can be identified with a family 

£ : TQ -> Lin(QxQ, QxQ) 

of skew-symmetric and operator-valued matrices such that cr| z (z, i) = (S| a i, z)q x q for all states z = 
(x, Xt) E Q x Q and arbitrary tangent vectors z, z £ Q x Q. The components Sy, i, j = 1,2, of S 
are linear operators Q — > Q and satisfy ST = — S^. Consequently, the Hamiltonian system (|2.3p is 
equivalent to 



SI 



_ ( d x TL{x, x t ) 



(x ' xt) d< UJ ~ la at W(x, x t ) 



Example 2.2. On each Hilbert space Q we can define the metric Lagrangian £ me t by £ mc t(x, Xt) 
i(x t , x t ). This implies H me t = ^mot and 



Cmet 



\(x,x t )({z, X t ), (X, X t fj = (± t , X) - (± t , X), Emet^a.xt) = ^ 



where 1 denotes the identity map Q — > Q. In what follows we refer to cr mo t and E met as the metric 
symplectic form on TQ. Moreover, for a normal Lagrangian with (|2.ip we find 7i(x, x t ) = IC(xt) + V(x) 
as well as 



-A/ 
M 



A/XL 



where we used M = M . 

The tangent-bundle approach to Hamiltonian structures is more general than the canonical one via the 
cotangent bundle, because it works even if the map Xt i— > tt is not invertible, but has the disadvantage that 
the symplectic form a depends explicitly on the Lagrangian C. Consequently, the Hamiltonian equations 
on TQ do not arise in canonical form. For the examples from §1.11 and ^1.31 we find 7r(x, Xt) = Xt so 
that the Hamiltonian structures on TQ and T*Q seem to be equal. However, both structures transform 
differently under scaling transformations, see Principle 12.31 and £12.21 

For a first motivation why we prefer the tangent-bundle and avoid the cotangent-bundle structures, 
let us study trivial scalings: Given a Lagrangian C on TQ, we consider the scaled Lagrangian C e = eC, 
where e > is some artificial small constant. The scaling of TL is given by TL e = (d Xt £ e , Xt) — C £ = eH, 
and similarly we find a e = eo~. On the other hand, the standard (canonical) approach applied to C £ yields 
W e (x, 7r) = eH[x, e _1 7r), and the canonical equations 

x t = +d 7T H E (x, 7r) = +d Tr Ti(x, £ _1 7r), 7T t = —d x H e (x, tt) = —ed x H(x, e" 1 ^) (2.4) 

again correspond to cf can , which does not depend on e. Of course, as long as s is fixed, both formulations 
are completely equivalent, since we can replace tt by eir in (|2.4p . However, if we try to identify leading 
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order dynamics by expansions in powers of s we obtain very different results. In fact, C e , TL e and <j e scale 
in the same way and, hence, e drops out in both the corresponding Lagrangian and Hamiltonian equations 
on TQ. On the other hand, for a normal Lagrangian, as given in (|2.ip . we find TL e = (tt, A/ _1 7r) +eV(x) 

and the formal expansion of H e gives if red (7r) = -^(ir, M _1 7r) as "leading order" Hamiltonian on T*Q. In 
particular, the corresponding canonical equations x t — e~ 1 M~ 1 tt and 7r t = do not recover the original 
dynamics. 

More generally, the key difference between tangent and cotangent Hamiltonian structures is related to 
the following Principle of Consistent Expansions: 

Principle 2.3. Suppose that the Lagrangian C obeys a (formal) expansion in powers of a parameter e, 
i.e., 

C(e)^s K (C + eC 1 +e 2 C 2 + ...). (2.5) 
Then the Hamiltonian structure on TQ obeys a corresponding expansion 

H(e) = e K (H + eHi + s 2 H 2 + ...), <j(e) = e K (tr + e<r x + e 2 o 2 + ...) 
and all expansions are consistent. This means, for each order e 1 we have 

Hi = (d^jCi, x t ) - A, o~i = a can . 
Proof. Since the fiber-derivative operation acts linearly on the Lagrangian we find 

$C = e K ($£o + e%d + e 2 $C 2 + ...) 
and this implies both the existence and consistency of the expansion of the Hamiltonian structure. □ 

The validity of Principle 12.31 is a remarkable property of the Hamiltonian structure on TQ and has 
no analogue on T*Q. In fact, (|2.5p implies a consistent expansion for the canonical momentum ir, i.e. 
7r(e) = e K (no + £7Ti + e 2 ir 2 + ...) with TTi = d Xt £i, but replacing xt by ir we normally end up with a 
non-consistent expansion for the Hamiltonian TL on T*Q. 

In the context of this paper we do not apply Principle 12.31 to the microscopic Lagrangian or Hamiltonian 
structures, since usually these do not depend on scaling parameters. However, the two-scale transforma- 
tions introduced in £12.21 strongly depend on e and so do the transformed Lagrangian and Hamiltonian 
structures. Thus, for the purpose of model reduction the tangent framework turns out to be very con- 
venient as it provides the consistency of the Lagrangian and Hamiltonian structures for all powers of 
e. 

2.2 Exact two-scale transformations 

As mentioned in the introduction, any micro-macro transition relies on an exact two-scale transformation 
which obviously changes the Lagrangian and Hamiltonian structures. All of the two-scale transformations 
considered in this paper are superpositions of elementary building blocks, namely 

1. (weak) symmetry transformations, 

2. moving-frame transformations, 

3. scalings of space and time coordinates. 

In this section we aim to describe how each of these building blocks transforms the Lagrangian and 
Hamiltonian structures on TQ. The concepts of symmetry and moving-frame transformations are well 
established in the theory of Hamiltonian systems, but since they are usually studied on the cotangent 
bundle we start with the reformulation of standard results. 
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2.2.1 Linear transformations 

Let Ton : Q — > Q be a linear isomorphism between Q and another Hilbert space Q with inverse Ton '■ 
Q — > Q. The canonical lifts of Ton and Ton to the tangent bundles are denoted by %, a \ : TQ — ■> TQ and 
T vc i : TQ — > TQ, respectively, and satisfy 3^ e i(a;, Xt) = (Ton^, T con x t ) as well as T^i = (Tv e i) . 

Remark 2.4. In what follows we use the inverse transformation T ve \ in order to pull back forms from Q 
(the pull-back with respect to T ve \ is the push-forward with respect to T ve {). In particular, we pull back 
functions T (0-forms) and symplectic forms a (2-forms). The images under this operation are denoted 
by T = (T vc \) T and a = (T vo i) cr, and satisfy 

F{z) = T(% c \z), o\s{z, z) = o-\f vei ~(f ve iz, TyeLZ), 
where z G TQ and z, z G T\{TQ. 

Theorem 2.5. Let C = C o T vc i 6e i/ie transformed Lagrangian and {TL, cr) i/ie associated Hamiltonian 
structure on TQ. Then, H. and a equal the transformed Hamiltonian and symplectic form, respectively. 

Proof. Let z = (x, xt) be given, and z = T ve \z = (x, it). The definition of C implies 

tt(z) = d it C{z) = (T con ) Tr(f ve iz), i.e. (n(z), -}q = (ir(f ve iz), f con -) Q , (2.6) 

where (T on ) is the adjoint operator to T con . From this identity we derive 

H(z) = (7r(5), x t )q - C(z) = (ir(f ve iz), f CO nXt) Q - C(% e \z) = H(% e \z), 

as well as 

(D*U(~z), ■>£ = (Dn\ fmiS (f v J), f con -) Q (2.7) 
for all z G T|jTQ. Finally, combining j2J| with ([221) for z = %o\z we find 



°\f ATvelZ, f ve iz) = (DTv\j- -(f ve iz), T con £) ~ (Dn\f ~{% e \Z), f con x) f 



= (D7t\s{z), x)q - {Dtt\ & {z), x)q = 5\&{z, z), 
and the proof is complete. □ 
Corollary 2.6. The following equivalences are satisfied: 

1. A curve t i— > x(t) G Q satisfies the Lagrangian equation to £ if and only if the transformed curve 
1 1 — ► x(t) = T con x(t) G Q satisfies the Lagrangian equation to C 

2. A curve 1 1— ► z(t) G TQ satisfies the Hamiltonian equation to (7i, cr) «/ and emZi/ if the transformed 
curve 1 1— > z(r) = Tv e iz(i) G TQ satisfies the Hamiltonian equation to {TL, a). 



2.2.2 Weak symmetry transformations 

We introduce the notion of a weak symmetry transformation which describes a certain class of linear 
and invertible operators from Q into Q. Although both the Lagrangian and Hamiltonian structures 
are not invariant they behave nicely under such transformations. In particular, each weak symmetry 
transformation changes neither the fiber derivative of C nor the symplectic form cr. 

Definition 2.7. A weak symmetry transformation (with respect to the Lagrangian C) is a linear isomor- 
phism Tcon : Q Q with the following properties: 

1. T con is unitary, this means (T con x, x) = (x, T con x) for all x, x G Q . 

2. The canonical momentum 7r = d Xt C commutes with Ton in the sense that 

7r(T vc iz) = T con 7r(z) (2.8) 

holds for all z = (x, x t ) G TQ . 
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Moreover, T con is called a symmetry transformation if it respects the Lagrangian, i.e., C = C in the sense 
of Theorem O 

Remark 2.8. (i) Unitarity implies T con = T c ' on and (x±, x%) = (T con xi, T con X2) for all x\,x% € Q. 
(ii) Condition (|2.8p is equivalent to it = ft, see (|2.6p . and this implies 5"£ = and <r = a. (Hi) Each 
symmetry transformation satisfies TL — TL and cr = a. (iv) C = C is sufficient for (|2.8D . 

Example 2.9. Let Q = L 2 (RxT 1 ; dr/d</>) be the Lebesgue space of functions x depending on r\ £ R and a 
periodic phase variable G T 1 = [0, 27r], and let the unitary operator 7^ on be defined by (7^ on x)(r/, 0) = 
a;(?7, + sqT]) for some so- The Lagrangian C of the embedded Klein-Gordon equation, cf. (|1 . 1 1|) . is not 
invariant under the action of T con as the differential operator d n transforms into + sgd^,. However, 
the condition (|2.8p is satisfied. 

2.2.3 Groups of symmetry transformations 

The concept of symmetry groups is well established in mechanics and mathematics and plays a fun- 
damental role in the analysis of Hamiltonian systems. Here we summarize the definitions and basic 
properties. 

Definition 2.10. A (weak) symmetry group (with respect to the Lagrangian C) is a one-parameter family 
of s i— ► T con (s), s € R, of (weak) symmetry transformations that satisfies the following properties: 

1. The family is a group of unitary transformations, i.e., 7^ on (0) = IdQ_,Q and 

%on(s+s) = T con (s)T con (s), %on(s) = (T con (s) ) ~ 1 = (T con (s))' = T con (-s) 
for all s, s e R. 

2. The generator A CO n with A con x = lim s ^o s~ 1 (T con (s)x — x) is defined on a dense subset of Q. 
Consequently, the group s i— » T vc \(s) is generated by A vc i = A CO n X Aon- 

Remark 2.11. If C is invariant under the action of a symmetry group Noether's Theorem provides the 
integral of motion 

l(x, x t ) = (nix, x t ), AconX), n(x, x t ) = d Xt C(x, x t ), (2.9) 
i.e., X is conserved for any solution to the Hamiltonian equation (|2.3|) . 

Example 2.12. Let Q and C be as in Example 12.91 and for fixed 770 G R, </>o E R and all s G R 

let (T con (s) x)(r], <j>) — x(r) + sr)o, tfi + stfto). Then, s 1— > T con (s) is a symmetry group with generator 
Aon = Vad v + 0o(?0 and integral of motion 2 = r]ol spa , ce + ^oZphase, where T spa ce and X phase are given by 

una). 

Lemma 2.13. Each (weak) symmetry group satisfies a\ z (A ve \z, ■) = cLT| z (-) for all z G TQ andX from 
(1231) . 

Proof. For given z = (x, Xt) € TQ consider the curve t 1— > z(t) = T vc \(t)z € TQ, and its image under 
that is £ 1 — ^ z(t) = (x(t), 7r(t)) with x(t) = T con (t)x and w(t) = n(z(t)). Moreover, let z = (x, n) 
be an arbitrary tangent vector in T|j( )T*Q. Condition (|2.8|) implies (7r(i), x) = (7r(7^ e i(t)z(0)), f) = 
(7r(0), 7^ on (— t)f) and differentiation and evaluation for t = yield (714(0), x) = — (7r(0), A CO nx). This 
identity and the definition of ef can provide 

an I zq [(A (."■(..in 

x(0), 7T t (0)), (x, IT) ) 
= (7T, A con x(0)) - (7T t (0), Aoni) = (7T, Ao„s(0)) + (tt(0), Aoni)- 

Moreover, for 7r) = (tt, Aon^) we find dX|-( ) (z) = (77", Aon^(0)) + (tt(0), Aon^) and, hence, 
Ocan|z(o) (^t(O)j ') = dT|z(o)(-). Finally, pulling back this identity via and using z t (0) = A ve iz(0) 
completes the proof . □ 
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2.2.4 Moving frames 

In this section we consider a time-parametrized family of invcrtible transformations of the configuration 
space M con (t) : Q — > Q and denote the family of inverse transformations by .M con (/j). Taking into 
account the time dependence we shall lift this transformation to the tangent bundle as follows: Each 
time-parametrized curve t t— ► x(t) in Q provides a lifted curve t i— > (x(t), Xt(t)) in TQ, where Xt(t) 
denotes the tangent vector at time t, i.e. Xt(t) = ^x(t). Consequently, the lift of the transformed curve 
t 1— > x(t) = M CO n(t) x(t) is given by 

1 1 * (x(t), x t {t)) = (x{t), &x t (t)) = (M con {t) x(t), M con {t) x t {t) + (f t M con (t)) x(t)), 

and we read-off the definition of Ai ve \(t), that is 

A4 ve i(t) : (x, x t ) i— > (i, it ) = {M con (t) x, M con (t) x t + (f t M con (t)) x) . (2.10) 

The transformation of a Hamiltonian structure under a time-dependent transformation is in general quite 
complicated. Therefore we solely discuss time-dependent transformations that are related to moving 
frames. 

Definition 2.14. The transformation M C on(t) is called a moving-frame transformation (with respect to 
the Lag rangiciii >C) if it is related, to a symmetry group s ^- > / 7^on(s) via A4 con (t) = T con (t). This implies 
-Mcon(O) = Hq^q and M con {t) = M con {-t) for all t. 

Example 2.15. Let Q be as in Example 12. 9[ and let £ mc t be the metric Lagrangian from Example 
12.21 Obviously. £ mo t is invariant under Galilean transformations (t, 77, <f) 1— > (i, ry, </>), where fj = T] — ct 
denotes the spatial coordinate in the moving frame. The corresponding time-dependent transformations 
■Mcon(t) '■ x i— > x and M vc \(t) : (x, Xt) <— > (i, it) can be read-off from the identification x(t, 77, (f>) = 
x(t, rj — ct, 4>) and are given by 

i(?7, 4>) = x(r) + ct, 4>), i t (?7, <j>) = x t {r\ + ct, 4>) + cx v (r) + ct, 4>), 

where x v abbreviates the derivative of x with respect to rj. The underlying symmetry group (T con (s)x)(r), <f>) = 
x(r/ + cs, 4>) has the generator _4 con = cd v and the conserved quantity T(x, xt) = c J R xt x v drj £ R. The 
lifted transformations A4 vc \(t) and T ve \(t) are really different because of T vc \(t) = T con (t) x T con (t). 

For moving-frame transformations wc can decompose the lifted map as follows: Definition 12 . 141 implies 
jt.-M C on{t) = A CO n%on(t), and using (|2.10[) we conclude that 

M ve i{t) = TZ VC \ o % c i(t) with 7?. vc i : {x, x t ) i-> (x, x t + Aon x), 
M ve i{t) = % c \(t) o 7?. V ci with TZ vc i : (i, x t ) h-> (x, x t - A con i). 

In what follows we denote by C the transformed Lagrangian, i.e. C(t) = C o A4 vc \(t), and with (H, a) 
the Hamiltonian structure corresponding to C. However, since the Legendre transformation does not 
commute with M vc i(t) we can not expect Ti., that is the Legendre transform of C, to equal the transformed 
Hamiltonian. For this reason we identify Ti with E, and define £ = £0 M vc \(t). This notation is motivated 
by normal systems, see (|2.1[) . for which the Hamiltonian Ti equals the total energy £ = JC + V. Finally, we 
write I = Io A4 ve \(t), where T(z) — (tt(z), *4 CO n^) is the integral of motion associated to the symmetry 
group. 

Next we prove that all these quantities do not depend on time, as it is already indicated by the 
notation, and derive the transformation rules for the Hamiltonian structure. 

Theorem 2.16. Moving-frame transformations satisfy 

C = C o 1Z vc i, £ — £ o 72-veii T = X o TZ YC \. 

Moreover, we have 

H = £+l, a = (K vci )*cr, 
where (H, a) is the Hamiltonian structure associated to C 
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Proof. Let t be fixed, and for arbitrary z = (x, Xt) let z = (x, x t ) = M. ve \(t)z. Due to the invariance of 
C under T vo \(t) we have C = (C o T vc \(—t)) o 1Z VC \ = C o 7t ve i, and this implies 

7r(z) = d £t C(z) = d £i (C(x, x t - A coa xt)) = n(x, x t - A CO nx) 

so that 7r = 7r o 7t VG i. We conclude that $C = $C o TZ VC \ = (7vL vo i) and hence a = (R vc i) o. The 
unitarity of T con (t), the identity A CO nTcon (t) = ^on(i)-^con and Formula f2.8|) yield 

1(f) =I(M ve l(t)f) = (7T(M ve l(t)2), AonT con (t)x) = (ir(M ve l(t)z), f CO n(t)A con x) 

= (ir(T ve i(t)M ve i(t)z), A con x) = (irfeyeiz), A con x) = l(U ve iz), 

the desired result for X. Analogously with (£ + C)(z) — (tt(z), x) we find 

(£ + C)(z) = {£ + C)(M ve i(t)z) = (ir(f vel (t)K ve iz), f con {t)(x t - Aoni)) 
= (ir(R, ve iz), x t - Aconx) = (£ + C)(H ve \z), 

which implies the formula for £. Finally, 

H{z) = (tt(z), x t ) - C(z) = (7r(7l vc iz), x t ) - C(U ve \z) 

= (7r(7?. vc iz), UyelXt) + {^(Uyeiz), A con x) - C(jty e \z) = £{H vc \z) +I(jty e \z), 

and the proof is finished. □ 

M yel (t) 



H = {d Xt C, x t )-C 



TL = (d it C, x t ) - C 



n = £ 



M vei (t) 



H = £+I 



H 



Figure 2: £, £, TL and their transformed counterparts in a moving frame. 



The results of Theorem 12.161 can be reinterpreted as the transformation rule for Hamiltonian struc- 
tures, see Figured hi foctj o equals the pull-back of <r, and to obtain TL we pull back the sum of the 
Hamiltonian £ = TL and conserved quantity X. As a consequence we gain the following result. 

Corollary 2.17. The following equivalences are satisfied. 

1. A curve t t— > x(t) € Q solves the Lagrangian equation to C if and only if the transformed curve 
1 1— > x{t) = M con (t)x(t) € Q solves the Lagrangian equation to C. 

2. A curve t i— ► z(t) £ TQ solves the Hamiltonian equation to (TL, a) if and only if the transformed 
curve 1 1 — y z(t) = AA vc \(t)z(t) £ TQ solves the Hamiltonian equation to (TL, a). 

Proof. Since (TL, a) is the Hamiltonian structure associated to C it is sufficient to prove the equivalence in 
the Hamiltonian framework. Let z(t) = lZ ve \z(t) such that z(t) = Ai ve i(t)z(t) satisfies z(t) = T vc \(t)z(t). 
Now suppose that t ^ z(t) solves the Hamiltonian equation to (TL, a). This means 
&\z(t )(zt(t) Q , •) = dH\z(t ){') f° r arbitrary but fixed to, and Theorem 12.161 provides 

<r\z(t )(zt(to), = d£\ Hto) (-)+dX\ i{to) (-). 

By construction we have £t(to) = T V ei(to)zt(to) + A ve \z(to), and exploiting Lemma 12.131 we find 

o-\% Ql (t )z(t ){%e\{to)zt{t ), ■) = d£| Tvol( t o)z( t o) (-) = dW| Tvol(to)z(to) (-), 



and the invariance of L, TL and a under T ve \(to) (cf. Remark 1 2. 8 p shows that 1 1— > z(t) solves the Hamilto- 
nian equation to {TL, er). Finally, in order to establish the equivalence we argue in the reverse direction. □ 
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2.2.5 Scaling transformations 

The two-scale problems considered in $31 involve suitable scalings of space and time variables. We always 
suppose that there exist positive constants f3 and 7 such that r = e^t and y = e^rj. In particular, ^ = 
and = £ 7 are the scaling constants for time and space, respectively. 

The spatial scaling can be encoded in a linear and invertible scaling transformation S con : Q — > P that 
maps the microscopic configuration space Q into P, the space of all macroscopic configurations. In what 
follows P is always a Hilbert space, usually some £ 2 -spacc, with inner product ((■, ■)) and § con : P — > Q 
is the inverse to iS con . The elements of P are denoted by X and are functions of the macroscopic space 
variable y. 

Definition 2.18. A scaling transformation is a scaled isometry iS con : Q — > P, i.e., 

(x, x) = e^((S con x, S con x)) 

for some exponent \i and all x, x € Q. 

Notice that P does not depend on the scaling parameter e, whereas the transformations <S con and § con 
as well as the scaled Lagrangian and Hamiltonian structures will strongly depend on e. Nevertheless, for 
the moment e is an arbitrary but fixed parameter and hence we do not denote explicitly the dependence 
on e. 

Example 2.19. Let Q = L 2 (R; drj) and P = L 2 (M.; dy), and consider the two-scale ansatz x(t, rj) = 
e a X(e^t, e^rfj. In this case we have 

(S con x)(y) = e- a x(e^y), (§ con X)( v ) = e a X(s">n), 



{x, x)= x(rj)x(r]) dr] = e 2a / X(e 7 ?7)A > (e 7 r;) d?7 = £ 2q -T((X, X)). 



providing 



The transformation 5 con : Q — > P does not take into account the time scaling, since this is related to 
reparametrization of curves as follows: Let 1 1— > x(t) be any curve in Q with tangent vectors 1 1— > Xt(t), and 
let 1 1— > X(t) = S con x(t) be the transformed curve in P, which has tangent vectors 1 1— ► X t {t) = 4rX(t) = 
Scon Xt{t). In view of the time scaling we are not interested in X(t), but refer to the reparametrized curve 

r ^ X(t) = X(i(r)) = S con x(t{r)) 

with rescaled tangent vectors 

r ~ X T (r) - &X{t) - f S con x t (t(T)). (2.11) 

For this reason we denote elements of TP = PxP by (X, X T ) instead of (X, X t ). Moreover, we must 
take into account this reparametrization when defining <S ve i, i.e. the lift of S con to a map TQ — > TP. In 
fact, using (|2.11j) we find 

Svei = (x, x t ) h-> (X, X T ) = (S con x, e~ p S con x t ). 
Example 2.20. Using the notations from Examplc l2.19l wc obtain 

(S ve} (x, x t ))(y) = (e' a x, e~ a - & Xt) (e^y) , (S vel (X, X T ))(n) = (e a X, s a+f3 X T )(e"'r ) ). 

Following the proof of Theorem 12. 51 we derive the transformation rules for the Lagrangian and Hamil- 
tonian structures. To this end, let L = £o§ vd be the transformed Lagrangian and (H, er) the associated 
Hamiltonian structure on TP, i.e., HI is the Legendre transform of L and cr = (5 r L)*o r ca n, where <T C an is 
the canonical symplcctic structure on T*P. 

Theorem 2.21. We have H = TL o S ve i and cr = ^-cr, where cr = (S V oi)*c- 



15 



Proof. The definition of L implies ((U(Z), •)) = (7r(S vcl Z), ^S CO n->, where Z = {X, X T ) and II = <9 Xt L. 
We conclude that 

H(z) = «n(z), Xr)) -l(z) = (A^dZ), %s con z T ) - c(s vel (z)) = n(s vcl z) 

and ((DU\ Z (Z), •)) = (Dn\ SvclZ (S vc iZ), j^S con -) for all Z G T\zTP. Inserting this identity into the 
definition of cr, compare (|2.2p , we obtain 

*\ Z {Z, Z) = ((DIL\ Z (Z), X)) - ((DU\ Z (Z), Z)) 

= (£>7r| SvelZ (SvelZ), ^S co „X) - (Dn\ SvolZ (S vcl Z), %§conX) 

= ^(DTr\s velZ (E> ve iZ), Scon-^) — ^(-D7r|s vcl z(§veiZ), E> con X) 
= ^&\s vol z (^vc\Z, § vc \Z^j = ^g&\ z (z, Zj, 

which is the desired result for cr. □ 

The additional scaling parameter in the formula for cr appears naturally due to the reparamctrization 
of curves. More precisely, the microscopic Hamiltonian equation is equivalent to 

&\ z {i t Z, ■)=dH|,(.), 

but since here the solution still depends on t we reparametrize via 4? = 4| -p . 

Example 2.22. Let £ be a normal Lagrangian, cf. (|2.ip , and consider a simple time scaling t r = st 
with P = Q and the two-scale ansatz x(t) = X(et). Then 

Svoi = (l °j , L(X, X T ) = \e 2 {X T , MX T ) - V(X) with II(X, X T ) = e 2 MX T , 

and a simple calculation yields 

M{X, X T ) = \e 2 {X Tl MX T ) + V(X), cr = (£L) V can ~ e 2 M 

The pull-back of a via S ve i is given by 

C> = (S V el)V ~ S^ol ([ ~f) §vcl = £ (l ' 

and differs from cr by the factor e^ 1 . 

In what follows we refer to L and (H, cr) as the macroscopic Lagrangian and Hamiltonian structures, 
but we recall that microscopic and macroscopic structures are completely equivalent as long as e is a fixed 
but positive parameter. Consequently, we find the following transformation rules for solutions. 

Corollary 2.23. The following equivalences are satisfied. 

1. A curve t t— > x(t) € Q solves the microscopic Lagrangian equation to C if and only if the transformed 
and reparametrized curve t h- > X(t) — S con x(t(r)) € P solves the macroscopic Lagrangian equation 
to L. 

2. A curve t t— > z(t) G TQ solves the microscopic Hamiltonian system to (7i, a) if and only if the trans- 
formed and reparametrized curve t i— > Z(t) = S ve \ z{t(r)) £ TP solves the macroscopic Hamiltonian 
equation to (H, cr). 
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2.2.6 Two-scale transformations 

Since space-time scalings depend on the parameter e, from now on wc denote a scaling transformation 
and its inverse by iS con (e) and S con (e)j respectively. Consequently, both the macroscopic Lagrangian and 
Hamiltonian structures will depend on e, and thus wc write L = L(e), H = H(e) and cr = cr{e). However, 
we always choose the macroscopic configuration space P as independent of e. 

The two-scale transformations considered in Sj3]are compositions of a scaling transformation S con (e) : Q — > 
P, a moving-frame transformation M con (t) : Q — > Q and a symmetry transformation T con : Q — ► Q. More 
precisely, a general exact two-scale transformation T con {e, r) : Q — > P and its inverse T con (e, t) : P — ► Q 
are given by 

% on (e, t) = S con (e) o M CO n{t(e, r)) o 7^ on , T con (e, i) = 7^ o M con {-t) o § con (e). 

For convenience we parametrize forward and backward transformations by r and t, respectively, i.e., 

T coa (e, r(e, i)) o T con (e, t) = Id P ^ P , T con (e, t(e, r)) o T con (e, r) = Wq^q. 

Moreover, the lifted transformations are given by 

T vcl (e, r) = 5»d(e) ° M vc i(t(e, r)) o T vcl , T vel (e, t) = T^ 1 o M ve i(-t) o § vcl (e). 

Example 2.24. The KdV reduction relies on the scaling r = e 3 t, y = erj and the two-scale ansatz 

x(t, 7]) = eX (e 3 t, erj + set) , 

where x e Q = i 2 (K; d?y) and X E P = L 2 (R; dy). From this ansatz we can read-off directly the 
inverse two-scale transformation T con (e, t) = A4 con (—t) o S CO n(e), which consist of the inverse scaling 
transformation (S con (e) X)(r]) = eX(er]) and the inverse of the moving- frame transformation 

{M con (t)x)(ri) = x(r/ - ct). 

Moreover, X(x, Xt) = — d svajce {x, x t ) — —c J R Xt(j]) x^iyj) dr/ is the integral of motion associated to 
M con (t). 

For later purposes we prove two auxiliary results. The first lemma describes how to restrict Lagrangian 
and Hamiltonian structures to subspaces of P, and the second one allows us to compute X from S, the 
matrix-valued maps corresponding to cr and tr, respectively. 

Lemma 2.25. Let P C P be a closed subspace of P, embedded via a linear and continuous operator 
Icon : P P with canonical lift J vc i = J con x J con : TP TP. Moreover, let L = L o J vel be the 
restricted Lagrangian on TP and (H, cr) the associated Hamiltonian structure. Then, i = io J ve i and 
cr = (Jvel) c- 

Proof. The projector corresponding to J vc i is denoted by J VG i : TP -» TP and equals the adjoint of J ve i- 
Notice that J vc i ° Jvci = Id TP but ker (J vo i o J vcl ) D {0} for P C P. Besides this modification the proof is 
entire similar to that of Thcorcm l2.5l □ 

Lemma 2.26. Let e and t be fixed and suppose there exist two linear and invertible transformations 
S :TP^TQ with (S Z, S Z) = e^ l ((Z, Z)) for some \i and T :TP -^TP such that T vol (e, t) = S o T. 
Then, 

E|z = ^e^T'S- 1 J:\stz ST, (2.12) 

where T is the adjoint to T . 

Proof. Let Z £ TP be fixed and choose two arbitrary tangent vectors Z, Z £ T\zTP. Moreover, set 
z = STZ and z = STZ, z = STZ. The definition of cr and the linearity of T vcl (e, t) = S o T imply 
cr\z{Z, Z) = ^a\ z (z, z), and this gives 

{{J1\ Z Z, Z)) = f (S| 2 i, z) = % (SS-^USTZ, STZ) 

= %e»{{S- 1 n z STZ, TZ)) = %e»((T'S- 1 X\ z STZ, Z)), 
the desired result. □ 
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Remark 2.27. For £ = £ me t and S — S x S with S : P — ► Q we have S 1 ES' = S met , where S mo t and 
S mct correspond to the metric symplectic forms on TQ and TP, respectively, see Example 12.21 In this 
case (f2~T2l becomes S = ^e^T"E mot T. 

2.3 Reduction principles 

In this section we suppose that an exact two-scale transformation has already transformed the original 
microscopic system into a macroscopic one on TP, where P is a Hilbert space with inner product ((•, •)). 
As before, the macroscopic system has Lagrangian L(e), and the associated Hamiltonian structure on 
TP is given by (H(e), <r(e)). In the previous section we have shown how H(e) and <x(e) can be computed 
directly from their microscopic counterparts, but H(e) is always the macroscopic Legendre transform of 
L(e), and <r(e) equals (^h(e))*W can . 

In what follows we describe how the explicit dependence on e allows for a consistent model reduc- 
tion. As illustrated in a typical two-scale transformation provides an expansion of the macroscopic 
Lagrangian in powers of the scaling parameter e, i.e., we have 

L(e)=e K (L + eL 1 +e 2 L 2 + ...) (2.13) 

at least on a formal level, where k can be positive or even negative depending on the underlying two-scale 
ansatz. Recall that such an expansion is not available for the original microscopic system. 

Since we deal only with Hamiltonian structures on tangent bundles we benefit from Principle 12.31 In 
particular, the expansions 

H(e) = e K (H + dHi + e 2 H 2 + ...), tr(e) = e K (<x + ecr 1 + e 2 er 2 + ...) (2.14) 

are consistent with (|2.13|) . i.e., Hj is the Legendre transform of Lj, and we have er^ = ($~Li)*(T can . 

In the simplest case the reduced model is obtained by considering the leading order terms for L and 
(H, <x), and ignoring all terms that contribute to higher orders in s. However, depending on the two-scale 
ansatz the leading order system can be degenerate. For this reason we distinguish the following cases: 

Case A: The symplectic form <Xo is non-degenerate, i.e. there is no (Z, Z T ) G TTP with (Tq\z{Z t , ■) = 
0. 

Case B: (Tq is degenerate, but Lo depends on X T . 

Case C: The leading order Lagrangian Lo is quasi-stationary, this means independent of X T , and 
this yields Ho = — Lo and er = 0. 

Reduction in Case A Whenever we end up with Case A, the formal reduction provides a non- 
degenerate macroscopic Hamiltonian system and thus we have established already a (formal) micro- 
macro transition. In particular, the reduced Lagrangian reads L rcd = Lo and the associated Hamiltonian 
structure is given by (H red ,er red ) = (Ho,CTo), so that the macroscopic Hamiltonian equation on TP is 
given by 

* Ied \z(Z T , •) = dH rcd | z (-). (2.15) 

Recall, that we can neglect the pre-factor e K as it drops out in both the Lagrangian and Hamiltonian 
equations on TP. 

Since we have derived the reduced macroscopic structures by means of formal expansions with respect 
to e, we are confronted with the justification problem. More precisely, it is not obvious that solutions to 
(|2.15|) provide (approximate) solutions to the microscopic system. Of course, any curve r i— > Z(t) £ TP 
that solves cr\z(Z T , ■) = dH(-) and that obeys an expansion in powers of e, must satisfy (|2.15|) to leading 
order, but the existence of such an expansion for the solution Z(t) must be proven. This problem is 
very subtle and cannot be addressed here. Rigorous justification results for linear and some (weakly) 
nonlinear systems are given in |Mie08j . For a brief discussion of the difficulties that arise in the case 
of strong nonlincaritics we refer to t )3.2[ which shows that such an e-expansion can be valid only under 
additional assumptions concerning the initial data, the macroscopic time-interval under consideration 
and, finally, the regularity properties of the macroscopic equation. 
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Reduction in Case B In contrast to Case A, the Cases B and C allow for further reduction steps, which 
we explain next. We start with Case B and refer to the KdV reduction in £13.31 as a typical example. For 
simplicity we suppose that Lo depends linearly on X T , i.e., we assume that the momentum ITo = dx^o 
is a function of X but not of X T . As a consequence, the associated Hamiltonian structure lives on P, this 
means Ho is a function on P and a is a symplectic form on P. In fact, Ho = ((IIo(X), X T )) — "Lq(X, X T ) 
provides <9x T Ho = 0. This implies that the right hand side in 

£>n | (Jc .x T )(*> X T ) = d x Uo\x(X) 

is independent of both X T and X T , and due to (|2.2j) the form <Tq lives actually on P. Thus we end up with 
the following macroscopic model. The reduced Lagrangian L rcd = Lo lives on TP and has a consistent 
Hamiltonian structure on P given by H rcd = Ho|p and <r rcd = er |p. 

Reduction in Case C: Restriction to sub-spaces In some cases the leading order reduction turns 
out to be quasi-stationary, i.e., Lo does not depend on X T , and this implies Ho = — Lo and er = 0. 
Whenever this happens, we obtain a reduced macroscopic model as follows. Wc restrict the macroscopic 
configurations to 

P = {X £ P : = dxU{X ) = d x M (X ) } , 

and determine the reduced Lagrangian and Hamiltonian structures by restricting the next-leading order 
terms Lj and (H,;, tTj) to Pq. However, in general we shall expand additionally the solution X in powers 
of e, and this may produce correction terms in the expansions (|2.13|) and (12.141) . This problem will be 
discussed now, where for our purposes we can assume that Pq is a closed linear subspace of P. 
In order to identify suitable correction terms we start with the ansatz 

Z = (X, X T ) = Z a + eZ 1 = {X , X 0r ) + e{X u X u ), 

and study the Lagrangian L(e, Zq, Z\) = L(e, Zq + eZ-y) defined on TP with P = Pq x P. Exploiting 
dx T ^o = and 8x^o(Xq) = for all Xq £ Pq we find 

L(e, Z , Z t ) =e K (Lo(Xo + eX 1 )+eL 1 (Zo+£Z 1 )+e 2 L 2 (Z + eZ 1 ) + ...) 

= e K (Lo(Xo)+eL 1 (Zo)+e 2 (L 2 (Zo)+L 2 (Z , Z 1 )^j + ...), 

with first correction term 

L 2 (Z , Zx) = ±((d 2 x L (X )X u X x )) + ((dxU(Zo), XJ) + ((dxM&o), X lr )). 
In particular, any possible correction eZ\ effects L 2 but neither Lo nor Li. 

Case CI: Reduced model via Li If the next-leading order Lagrangian Li depends on X T , then the 
reduced Lagrangian is given by L rcd = Li|tp , and in this case we can ignore the correction term eZ\. 
Moreover, according to Lemma \2. 251 the corresponding Hamiltonian structure is given by (Hi, <Ti)|tp - 
An example for this case is the three-wave-interaction discussed in i)3.5l 

As before, the reduction to TPq is formal and must be justified rigorously. In the simplest case the space 
Pq is an invariant manifold for (H, a). This means that for all initial data chosen from TPq the solution 
t (— ► Z{t) to the original problem belongs to TPq for all times r > 0. In general, we expect that the 
restriction to TPq provides a reasonable reduced model if Pq is an approximate invariant manifold, so that 
solutions to (H red , <r red ) are approximate solutions to (H, a). For the justification in this case one has to 
prove that for all initial data chosen from TPq the real trajectory stays close to TPq (up to higher orders 
in e) at least for sufficiently small macroscopic times, see for instance jSWOO) lGM04|, IGM06) IGMS07| . 

Example 2.28. Let M be an integer, Q = L 2 ([0, M); dn) the Lebesgue space of all periodic functions 
on the interval [0, M], and let C(x, x t ) = IC(x t ) — V(x) be defined by 

IC(x t ) = \{x t , x t ), V(x) = -| (Arc, x), (x, x) 



M 

Jxxdn, 
(i 
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with discrete Laplacian (Ax) (77) = x{r) + 1) + x(ij — 1) — 2x(rj), so that the microscopic law of motion is 
the discrete wave equation xu = Ax. Moreover, consider the time scaling from Examplc l2.221 this means 
t = et, x = X , P = Q, y = ?/. Then, L obeys an (exact) expansion in powers of e 2 via 

L(e 2 , X, X T ) = h (X) + e 2 U(X T ), L (X) = -V(X), U(X T ) = K{X r ). 

The leading order Lagrangian and Hamiltonian equations read AX = and provide 

Pq = {X G P : X (y + 1) = X (y) }. 

Exploiting the next-leading order terms corresponding to Li we find 

L rcd = jjred = 1^ ^ ^rcd „ ^0 -1^ ^ 

so the macroscopic evolution is governed by Xo TT — 0. Moreover, the reduction is exact as both micro- 
scopic and reduced dynamics are equivalent for all initial data (X(0), X T (0)) € TPq. 

Case C2: Reduced model via L2 It may happen that even the next-leading Lagrangian Li|tp 
is quasi-stationary, i.e., Li(Zo) = Li(Xo) for all Zq = (Xq, Xq t ) 6 TPq. Then the general reduction 
procedure depends on the particular properties of Li . Here we restrict to the case we meet in i|3.4l (nlS 
equation), where the two-scale transformation implies 

Lilfb =-H!| Po =0. (2.16) 

For Li|p 7^ const we would restrict Xq further by imposing additionally dx {^i\p ) = 0. 

Notice that (|2.16|) does not necessarily imply dx^x (Xq) = G Lin(P, R) for all Xq G Po an d therefore 
we proceed as follows. Our strategy is to choose X\ in such a way that it is a stationary point of 
L 2 (Xo, X\). This means we seek X\ — X\(Xq) as solution to the affinc equation 

a|L o (Xo)X 1 +axL 1 (X o ) = 0. 

Provided this is possible, our reduced Lagrangian on TPq is given by 

L rod (Z ) = L 2 (Z ) +L 2 (X , X^Xq)), 

and since the term L 2 does not contribute to the fiber derivative JL rcd , one can show (similarly to Lemma 
|2T2"51) that 

H rcd (Z ) = m 2 (Z Q )+m 2 (X Ql Xx(Xo)), <x rcd = <t 2 \ TPo , 
where H 2 = — L 2 is the corresponding Hamiltonian structure on TPq. 

3 Two-scale reductions for the atomic chain 

The abstract framework developed in the previous section shall now be applied to the examples mentioned 
in the introduction. The microscopic system will be either the Fermi-Pasta-Ulam (FPU) chain 

Xj (t) = $i(x j+l (t) - Xj(t)) - SifoCt) - (3.1) 

or the Klein-Gordon (KG) chain 

Xj (t) = a(x j+ i(t) + Xj -i(t) - 2 Xj (t)) - & ( Xj (t)) (3.2) 

with harmonic constant a = $"(0) G R. Without loss of generality we always assume = $o(0) = 
^o(0) = ^i(0) = ^i(O), and restrict our considerations to infinite chains. In the case that the two-scale 
ansatz refers to small amplitudes, the linearized atomic chain 

Xj (t) = a(x j+ i(t) + Xj -i(t) - 2 Xj (t)) - $Z(0) Xj 
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becomes important. This linearized chain allows for propagating plane wave solutions e 1 ^^®^, provided 
that the frequency lo and the wave number 9 satisfy the dispersion relation 

uo 2 = n 2 (8) = 2a(l - cos(9) + <(0). (3.3) 

The atomic chain falls into the class of normal Hamiltonian systems, see (|2.1j) . with configuration space 
Qdiscr = ^ 2 (Z). The Lagrangian reads £discr(^, x) = ^discr(^) — Vdiscr(^) with kinetic and potential 
energy given by 

/Cdiscr(i) = \ ^2 ±2 j » V disc r(a;) = ( ^iOj+i - x i) + ®o(Xj) ) , (3.4) 

and Newton's equations (|1.13|) equal the Euler-Lagrange equations to £discr on TQdiscr- Moreover, the 
Hamiltonian is given by Tidiscr{x, x) = ^discr(i) + Vdiscr(a^), so that Newton's equations are equivalent to 

(o -iW fx\ (a x n disC r\ 

\1 J dt \xj \d ± H disc J ' 

which is a Hamiltonian ODE on TQdiscr with metric symplectic form, i.e., we have a = cr m ct in the sense 
of Example O 



3.1 The embedded atomic chain 

In order to derive effective models we start with a suitable embedding of the atomic chain. At first 
we replace the discrete lattice index j G Z by a continuous variable 77 G R. In addition, if the two- 
scale ansatz involves oscillatory microstructure, we consider k additional phase variables <p = {<fii, ••> 4>k), 
which are supposed to take values in the fc-dimensional torus T k . This embedding gives rise to the formal 
identification 

Xj(t) = x(t, j, 0), ±j(t) = x t (t, j, 0), 

where the instantaneous configuration x(t, •, •) is for each t a function in 77 and </>. 

Next, we identify the Lagrangian C of the embedded system. To this end, we replace all sums over j 
in (|3.4[) by integrals with respect to 77 and (j>. This yields 

C(x, x t ) = K(x t ) - V(x) (3.5) 

with 

K{xt) = y^dr^, V(x) = J ($i(V+ x) + * (a:))<W, (3.6) 

RxT k RxT k 

where is a discrete differential operator, sec Remark 13.11 below. Notice that the Euler-Lagrange 
equation for C, i.e. 

x tt = vi; $i(v£ a?) - a; = a?(*, 77, 0), 

is still fully equivalent to (an uncountable number of uncoupled copies of) Newton's equations (|1.13j) . 
However, the embedding gives rise to additional symmetries, and thus we gain new integrals of motion. 
In fact, the Lagrangian (|3.5[) is invariant under the continuous groups of space shifts 77 77 + 770 and 
phase shifts <j> (j) + <f>o, and Noether's theorem provides that 

I S pa.ce(x, x t ) = J x t x n drjdcj) G K, Xphase^, x t ) = J XtXcf, dr]d(j) G IR fc 

RxT k RxT k 

are conserved for any solution of the microscopic system. Recall that x v G M. and x^ G K fe denote the 
derivatives of x with respect to 77 and (j), respectively These conservation laws have no counterpart within 
the classical mechanics of mass points as they are a byproduct of the embedding. 
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Remark 3.1. For given S G R and 9 G T fc let 

( V M X )( 7 ?' <W = x (v + S, (f> + 0) - 1(77, (V^X^, 0) = ^ ^) ~ ~ S > <t> 
and A^e = — V^" e . These definitions imply 

7± _ V7T /T7± 



V^,_, = -V^, (V^J =-^l e , (A M )* = A M , 
where * denotes the adjoint operator with respect to the L 2 -inner product. 

3.2 From FPU to the wave equation 

Here we derive the quasi-linear wave equation from Newton's equation for FPU chains. Recall that the 
underlying two-scale ansatz is given by (] 1 . 14[) . and involves neither a microstructure nor a moving frame. 
For the embedded system this ansatz reads 

x(t, 77) = e~ 1 X(et, £77), (3.7) 

and Example 12. 191 provides Q = L 2 (R; d?7) and P = i 2 (R; dy) as well as the (lifted) inverse two-scale 
transformation 

T vel (e) : (X, X T )(y) ~ (x, x t )( V ) = (e^X, X T ) (e V ). (3.8) 
Lemma 3.2. The two-scale transformation (|3.8p yields L = K — V and E = K + V with 

K(e, X T ) = E^KoiXr), K (X T ) =J \X 2 T dy, V(e, X) = e^J ^ 1 (e- 1 W+X)dy. 

& R 

Moreover, we have H = E and cr = £ cr me t, where er mot is the metric symplectic form on TP, see 
Example POl 



Proof. All assertions are direct consequences of Q3.8j) and the abstract results from $'2.'2\ sec Theorem 
12.211 In particular, H = E holds, since the two-scale transformation does not involve a moving frame. □ 

Next we identify the leading order terms in the expansion with respect to e. Using formal Taylor 
expansion 

(s-^+X^y) = £ -\X{y + £) - X(y)) = X y (y) + \eX yy {y) + 0{s 2 ), 

we find V(£, X) = £- 1 V (X) + 0(1) with V (X) = j R $i(X y ) dy and conclude that 

L rod (X, X T ) = K (X T ) - V (X), H rod (X, X T ) = K (X T ) + V (X). 

Notice that Vo is defined only on i? 1 (IR; dy), which is dense in P. Finally, er rcd = <x met completes the 
leading order reduction and we end up with the following macroscopic model: 

Theorem 3.3. Both the formally reduced Layrangian and Hamiltonian equations are equivalent to 

X TT -& 1 {X v ) y = Q. (3.9) 

Remark 3.4. We claimed in the introduction that the Hamiltonian two-scale reduction is always related 
to the Hamiltonian structure on TP but fails if we use the canonical structure on T* P. In this example 
we clearly see the reason for this: The canonical momentum corresponding to L(e) is given by n = e~ 1 X T 
and, replacing X T by n, we find 

H(£, X, n) = «#x T L, X T )) -L = £ J \n 2 dy + e- 1 J ^(e^V+X) dy, 

R R 

the Hamiltonian on T*P. As long as we fix e > 0, the canonical equations X T = ell and n r = 
£~ 2 V~$ / 1 {e" 1 V+A) are fully equivalent to the Hamiltonian equations on TP. However, formal expan- 
sion of H with respect to e yields, to leading order £ _1 , the reduced Hamiltonian H red (A, n) = £~ 1 Yq(X) 
and the corresponding canonical equations A T = 0, H T = e~ 1 $' 1 (X y ) y are apparently different from the 
wave equation ()3.9|) . Of course, here we can overcome this problem by multiplying L with e, but this is 
not always possible as the KdV reduction in ^3.31 shows. 
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To conclude this section we discuss some aspects of Q3.9P which are closely related to the justification 
problem. In particular, it comes out that (|3.9[) can provide a reasonable macroscopic model for the FPU 
chain only under additional assumptions and this shows that the formal expansions from the reduction 
step truly need to be justified rigorously. We introduce new variables W = X T and R = X y and rewrite 
equation (|3.9p in the form 

d T R-d y W = 0, d T W-d y <f>[(R) = 0. (3.10) 

This is a first order system of macroscopic conservation laws with characteristic speeds A± = ±a/ &'{(R) 
and is called the p-system (with p = — see [DafOO] . These equations formally imply the conservation 
of energy i.e., any smooth solution to P3U|) satisfies d T E - d y (W$[(R)) = with E = \W 2 + $i(R). 

Now suppose that 3>i is concave or, more general, restrict R to the region of concavity of $i. In this 
case, the system p.lOp is elliptic and its initial value problem is ill-posed. For this reason the microscopic 
system behaves as follows: Even if we initialize the chain with data satisfying Xj(0) = e^ 1 X- m i{ej) and 
^i(O) = Wi n i(sj), where X- lri \ and Wj n i are infinitely smooth macroscopic functions, the atomic data 
will immediately start to oscillate on the microscopic scale, see |Her 05 DH07J for numerical simulations. 
Therefore, the two-scale ansatz cannot be satisfied for any t > and we conclude that any rigorous 
justification of (|3.9[) must exclude non-convex $i. 

Next suppose that <!>i is strictly convex, which provides the strict hyperbolicity of the p-system, and 
assume for simplicity that ^ is also strictly convex, so that all eigenvalues arc genuinely nonlinear. 
However, even in this case there exist limitations for the validity of (|3.10[) . In fact, it is well known that 
the nonlincarity of $i causes the following generic situation: Given smooth initial data for (|3.10p . there 
exists a critical time < To < oo at which the first macroscopic shock is formed. In particular, there exists 
a smooth solution for < t < tq, and for these times we can expect that (|3.7p provides an approximate 
solution of the microscopic system. However, for r > tq the macroscopic energy E is not conserved 
anymore and thus the p-system can not be related to the macroscopic dynamics of the chain, since the 
chain conserves the energy exactly. This phenomenon is usually called the shock problem and appears 
analogously in all zero dispersion limits, compare for instance the surveys in [Lax86, Lax91, LLV93J. For 
the FPU chain the macroscopic dynamics beyond the shock can be understood by Whitham's modulation 
theory with periodic travelling waves, see [FV99| IDHM061 IDHR06) IDH07j and |HFM81j [DM981 lEl"05] for 
the complete integrable Toda chain. Moreover, for harmonic lattices the macroscopic limit under the 
hyperbolic scaling can be established rigorously by means of weak convergence methods (cf. |Mie061 
IMie08j ). The transport of energies can be studied via Wigner-Husimi measures, see |Mie06j . 



3.3 From FPU to KdV 

To derive the KdV equation for FPU chains we rely on the two-scale ansatz 

x(t, n) = eX(e 3 t, en + ect) , (3.11) 

which is related to a moving frame with drift velocity c. Example 12.241 provides 

T vcl (e, t) : {X, X T )(y) ^ (x, x t )(n) = (eX, e 4 X T + e 2 cX y ) {en + ect) (3.12) 

with P and Q as in §3.21 The transformation T ve i is defined only on i? 1 (R; dj/)xL 2 (R; dy), a dense 
subset of TP, but in order to focus on the basic features of the reduction procedure we do not stress out 
this explicitly 

Lemma 3.5. Under the exact two-scale transformation (|3.12p the energies K, and V transform into their 
e -parametrized counterparts 

K(e, X, X T ) =e 3 J \{e 2 X T + cX y fdy, V(e, X) = e' 1 J $i (e\7+X)dy (3.13) 

K R 

and the matrix S corresponding to the symplectic form cr is given byT, = e 5 Si +e 7 S 2 with 




(3.14) 
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Moreover, due to the time dependence of the two-scale transformation, H differs from E and satisfies 
U = E + I with 

I(e, X, X r ) = -e 3 c J (e 2 X T + cX y )X y dy. (3.15) 

k 

Proof. For the proof of (|3.13j) and (|3.15|) we insert the ansatz (|3.1ip into the definitions of V, K, and I, 
cf. Formula (|3.6[) and Example 12.241 and replace e 3 t + ecq by y in the arising integrals. Moreover, the 
identity H = E + I is provided by Theorems 12.161 and 12.211 Finally, the linear two-scale transformation 
T vol (e, t) can be identified with S o T, where S : TP — > TQ is given by (S Z){rf) = Z(en + ect) and T 
abbreviates the operator-valued matrix 

£ 
e 2 cd y e 4 

Due to Lemma |2~21)1 and Remark |2~2"71 we find £ = ^ g2 T* S mct T, which yields ([3~UT| after a short 
computation. □ 



Leading order reduction At first we expand the various energies with respect to e up to 0[e 6 ). To 

2 x 6 



this end, we define Vi = <J>^ (0) so that the Taylor polynomial of $i reads $i(x) = ^x 2 + ^x 3 + h.o.t. 



Lemma 3.6. The transformed energies K, V and I satisfy 

K = £ 3 Ko + e 5 K 1 + C'(e 6 ), V = e 3 V + e 5 ¥i + C(e 6 ), 1 = e\ + e% + 0(e 6 ), 



whe 



K (X) = ^ y\ 2 dy, K X (X, X T ) = c J X T X y dy, 

K R 

V (X) = fjx 2 dy, VipT) = -1 / dy + f | X 3 dy, 

R R 8 

I (X) = -c 2 /" X 2 dy, % (X, X T ) = -cJ X T X y dy. 



Proof. The expansions for K and I follow immediately from Lemma l3.5l To prove the remaining assertions 
we start with eV+X = e 2 X y + £ Z \X yy + e A \X yyy + C(e 5 ) and obtain 

$!(eV+X) = f ( e V+X) 2 + f (eV+X) 3 + 0(|eV+X| 4 ) 

= -^(e 4 ^ + e 5 X y X TO + £ 6 |X 2 y + e 6 \X y X yyy ) + £ 6} fX y + 0(e 7 ). 

We insert this expression into the formula for V(e, X) and due to L X y X yy dy = and J K X y X yyy dy = 
— J R X yy dy, we obtain the asserted expansion for V. □ 

In the next step we can read-off the leading order terms of L = K — V, H = K + V + I and £. However, 
the order of e at which we find the reduced Lagrangian and Hamiltonian structures depends on the choice 
of the moving-frame speed c. Let us start with the case c 2 ^ vi ■ Under this assumption the leading order 
terms correspond to e 3 . More precisely, we obtain er red = and 

U cd (X) = -W cd (X) = K Q (X) - V (X) = \{c 2 - v 2 ) J X 2 dy. 

In particular, both the reduced Lagrangian and Hamiltonian equations turn out to be equivalent to 
X yy = and have no non-trivial solutions at all. Thus, we assume 

c 2 =v 2 = $'/(0), (3.16) 

i.e., the moving-frame speed equals the sound velocity of the linearized FPU chain. In this case we find 
Ko = Vo = — ^lo and this leads to cancelations in L and H. Consequently, the leading order terms in the 
Lagrangian and Hamiltonian structure correspond to e 5 and we end up with the following macroscopic 
model: 
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Theorem 3.7. With (|3.16[) the reduced Lagrangian and Hamiltonian structures are given by 

h Iod (X,X T )=K 1 (X,X T )-Y 1 {X), W Icd (X) = VipO, S red = Si. (3.17) 
In particular, both the reduced Lagrangian and Hamiltonian equations are equivalent to 

2 CX T y — J2 V2 Xyyyy ~ V^XyXyy = 0, (3.18) 

which is a KdV equation for X y . 

Proof. The identities (|3.17j) can be read-off from Lemma I3H1 and ()3.18p follows by a direct calculation. □ 

The KdV reduction with (|3.16|) is an example for Case B from Section ^2.31 i.e., the reduced Hamil- 
tonian structure lives on P and not on TP. Moreover, the term Iq, which produces the cancelation in Lo 
and Ho, equals up to the sign the macroscopic integral of motion I red (X) = J X 2 dy, associated to the 

R 

invariance under shift in the y-direction. 

Remark 3.8. As before, the formal two-scale reduction relies on the Hamiltonian structure on TP but 
fails if we use the canonical structure on T* P. Even worse, here we cannot overcome this problem by a 
simple rcscaling of L. To understand this, we consider the rescaled Lagrangian (for c 2 = V2) 

L(e, X, X T ) = £- 5 L(e, X, X T ) = K X {X, X T ) + e 2 K 2 (X T ) + V(e, X), 

where 

K 2 (X T ) = c J X r X y dy, V(e, X) = £- 5 (V(e, X) - e 3 Y Q (X)) » Yi{X) = (9(1). 

R 

The canonical momenta are given by n = cX y + e 2 X T and computing the associated Hamiltonian on 
T*P wc find 

t(e, X, n) = e-' 2 t (X, n) + V(e, X), %(X, n) = J i(n 2 - c 2 X 2 ) Ay. 

E 

In particular, the canonical equations corresponding to the leading order Hamiltonian Ho do not equal 

Finally, wc mention that rigorous justification results for the KdV reduction can be found in |SW00( 
IFP99| . However, these results do not use the reduced Lagrangian or Hamiltonian structures, but work 
on the equation of motion directly. 

3.4 From KG to nlS 

We start with the two-scale ansatz (|1.15j) for a modulated pulse in the KG chain (|3.2|) with a = 1 and aim 
to show that the complex amplitude A satisfies the nlS equation. Recall that the plane waves appearing 
in (|1.15[) model a microstructurc of harmonic oscillations, and thus wc can regard the nlS equation as a 
macroscopic modulation equation. 

In contrast to the previous examples, here the two-scale ansatz does not provide immediately an exact 
two-scale transformation, but we can setup the problem as follows: Wc embed the discrete lattice Z into 
the cylinder IxT 1 and identify each microscopic configuration with a function x 6 Q = L^KxT 1 ; dr?d0) 
depending on the microscopic continuous space variable n and a periodic phase variable </> G T 1 = [0, 2ir]. 
Moreover, in accordance to the scaling, we choose P = L 2 (Mxr 1 ; dydcf) and make the two-scale ansatz 

x(t, 77, 4>) = eX(e 2 t, erj-ect, <f> + uit + 6r{) , (3.19) 

which gives rise to the inverse two-scale transformation 

(T vcl {e, t){X, V r )) (r/, <j>) = (eX, e 3 X T - e 2 cX y + ecuX^)^ - set, <p + uot + 6n). (3.20) 

In this section we show that this transformation implies both the particular form of the microstructurc 
and the nlS equation. 
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Remark 3.9. From (|3. 19[) we read-off the identity T con (e, t) = o A4 CO n(—t) ° § C on(£), where 

(E> con (e)X)(n, 0) = eX(en, <f>) denotes the inverse scaling transformation. Moreover, (M CO n(t) x)(r], </>) = 
x(j] + ct, 4> — cot) is a moving frame transformation with associated integral of motion I(x, Xt) = J RxTl Xt^cx^ — tox^)dr]d(j) 
and (T con x)(r], 4>) = x(rj, (f> — drf) corresponds to a weak symmetry transformation. 

Lemma 3.10. Under (|3.20p the transformed energies K ; V and I take the form 
K(e, X, X T ) = e- 1 J \{e 3 X T - e 2 cX y + ewX^f dyd(f>, 

RxT 1 



V(e, X) = e- 1 J {-e 2 \XA e ^X + ^{eX))dyd^ 

RxT 1 

I(e, X, X T ) = e- 1 J (e 3 X r -e 2 cX y + eujX^)(e 2 cX y ~ecjX^dyd^ 



IxT 1 



and we have L = K — V, E = K + V and H = E + I. Moreover, the matrix S corresponding to the 
symplectic form a satisfies X = e 3 S2 + £ 4 ^3 + e 5 S4, with 

_ f-2tud^ 0\ (2cd y 0\ _ fO -1 



o o / ' J V o o / ' 4 V i o 

and is non- degenerate due to S4. 

Proof. At first we study the time dependent transformation M. c(m {t) o % on : Q — > Q, see Remark 13. 9[ 
and write a; = T con x and i = .M con (i):r. According to Theorem 12. 51 the transformation T con transforms 
("H, a) into (7i, a) with TC = TLo%~^ and 17 = (T^) o. Then we apply the moving frame transformation 
M ve i(t) and Theorem 12. 161 provides the Hamiltonian structure (H, a) with a = (A4 ve \(—t)) a and H = 
(£ +J) o M ve i{-t) with £ = H and 



X(x, it) = / Xt(cx v — Lox^)drjd(f>. (3-21) 

RxT 1 

Moreover, exploiting Theorem 1 2 . 2 1 1 for the scaling transformation S con (e) = S^ n (e), we find 

H = (£ + 1) o T-J o M ve i(-i) o § vol ( £ ), <t = e 2 (T-J o M ve i(-t) o § vc ,(e))V 

with I = lo 7^oi. According to (|3.6[) and (|3.21[) . the microscopic energies are given by IC(xt) = 
\ Lxt 1 x ? dr ? d< ^ and 

V(x) = y ^ — \xt\\^x + &o(%)j drjdc/), X(x, x t ) = J Xt(cx v — c&x$ — uix^drjd^, 

RxT 1 RxT 1 



where the discrete operators V and A are defined in Remark 13.11 The expressions for K, V, L, E 
and I now follow by inserting (|3.19[) into the formulas for JC, V and I. For the computation of X we 
identify the linear two-scale transformation T ve i(e, t) with SoT, where S : TP — > TQ is given by 
(SZ)(rj, 4>) = Z(erj — set, <f> + uit + On) and T abbreviates the operator- valued matrix 

e 
— e 2 cd y + eujdcj, e 3 

with components in Lin(P, P). Finally, Remark [2~271 yields £ = ^ ^T'S mct T and this implies the 
desired result. □ 
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Leading order reduction Next we derive the formal expansions with respect to e. To this end we 
introduce the constants v\ = $g^(0) and find, due to vq = V\ = 0, 

$ (eX) = e 2 f X 2 + e 3 f X 3 + e 4 ^ X 4 + 0(e 5 ) . (3.22) 

Lemma 3.11. The transformed energies satisfy 

1. I = £l + e 2 Ii + £ 3 I 2 + C(e 4 ) with I a (X) = -lu 2 j X% dyd<j>, 

RxT 1 

\{X) = 2ujcJ XyX^dydcj), h(X, X r ) = -c 2 J X 2 dyd<b -w J X T X^dyd& 

RxT 1 RxT 1 RxT 1 

2. K = eK + e 2 Ki +e 3 K 2 + C(e 4 ) with Kq(X) = -\ I (X), K t (X) = - \ Ii(X) and 

K 2 (X, X t ) = -\1 2 (X) + \uj J X T X^dyd^ 

RxT 1 

3. V = e¥ + e 2 V : + e 3 V 2 + 0(e 4 ) with 

MX) = ~\ f XAo^Xdydcji, + f /" X 2 dyd0, 

KxT 1 RxT 1 
RxT 1 RxT 1 

V 2 (X) = -±Jx(y+ e -Vo te + 2Id)X vv dyd<l> + f 4 Jx 4 dydcj ) . 

RxT 1 RxT 1 

Proof. The expansions for K and I follow directly from Lemma 13.101 Moreover, Taylor expansion with 
respect to e yields 

A e , e X = A , e X + e (v+ e + ^,e) X v + ^ 5 (v o + e - V„ ifl + 2Id) X w + 0(e 3 ) , 

and this gives rise to the first kind of integrals in (|3.23j) . Finally, inserting ()3.22p into L xT i &o(sX) drjdcj) 
completes the proof. □ 

According to Lemma 13.111 the leading order Lagrangian and Hamiltonian equations are given by 

~uj 2 X u + A , e X - v 2 X = 0, (3.24) 

and, using Fourier transform with respect to <j), wc conclude that this equation has nontrivial solutions 
if and only if lo and 6 satisfy m 2 iv 2 = Q 2 (m8) for some integer m, where Q is the dispersion relation for 
the linearized chain with a = 1, that is 

ti 2 {0) = v 2 + 2{l-cos6). (3.25) 

In what follows we always assume to 2 = fl(8) 2 as well as the non-resonance condition 

m 2 uj 2 ^ n(m8) 2 for m G Z \ {-1, 1}, (3.26) 

which imply that the solution space to (|3.24[) in L 2 (T 1 ; dcjA is spanned by cos <f> and sin (j>. 

Theorem 3.12. Suppose uj 2 = Vl 2 (6) and (|3.26p . Then, the leading order Lagrangian and Hamiltonian 
equations (|3.24[) are quasi- stationary (corresponding to So = 0) and have solutions 

X (y, cj>) = ^^(b^v) cos (<f>)+B 2 (y) sin ((/>)) (3.27) 

with Bi, B 2 G L 2 (R;dy) arbitrary. Moreover, (pHTj) implies = h Q (X a ) = U (X ). 
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Proof. All results follow from Lemma [3.111 □ 
Introducing a complex valued amplitude A by = B± — IB2, Equation (|3.27|) transforms into 

X (y,4>)=2R e {A{y)e^), (3.28) 
and is hence equivalent the original two-scale ansatz (| 1 . 1 5[) . 

Elimination of the microstructure The leading order reduction determines the structure of the 
microscopic oscillations together with the dispersion relation. As discussed in Case C2 of W2.2>\ this allows 
for a further reduction step that yields the macroscopic modulation equation for the amplitudes B\ and 
B2, or, cquivalently, for the complex- valued amplitude A. Let Pq be the L 2 -spacc of complex- valued 
functions depending on y, i.e., 

P = {(B u B 2 ) e L 2 (R;dy)xL 2 (R;dy)} = {A e L 2 (R; C)} , 

which can be viewed as a closed and proper subset of P due to (|3.27|) . By construction, each element 
of Pq satisfies the leading order equations exactly, and thus we can use the next-leading order terms in 
order to derive the effective macroscopic dynamics on TP . 

Below we choose the moving frame speed c appropriately, and this yields Li |p = as well as Hi \p a = 
due to cancelations. Consequently, the reduced structures are related to L2, and hence we must take care 
of the correction terms L2 and H2 coming from the ansatz X = Xq + eX\ , see Case C2 in H2.'Si 

Lemma 3.13. With X = Xq + eX\ we have 

L 2 (A , X x ) = Lo(Xi) + J X 1 {2wcX 0v<j> + (v+ e A ,, + V^AqJ - f X 2 ) dyd^ 

ExT 1 

and H2(Ao, X±) = — L,2(Ao, Ai). Moreover, all corrections to the symplectic structure are of order e 3 
and do not contribute to Yi2- 

Proof. The correction terms for I, K, and V can be read-off from Lemma 13.111 More precisely, we find 
1. h(X + eX l )= UX ) +%{X , Xx) with I (A , Ai) = and 



\{X Q , Ax) = 2u 2 J X,X ^ dyd0, 

RxT 1 

I 2 (A , Ai) = I (Ai) - 4wc [ AiAo^dj/d^, 



IxT 1 



2. Kj ( A + £ Ai) = K,(A ) + K*(A , Ai) with K (A , Ai) = and 

Ki(X , *i) = -5li(A , Ax), £ 2 (A , A x ) = -±I 2 (A , Ax), 

3. V,(A + eX l )= Vi(Ao) + V 4 (A , A x ) with V (A , Ax) = and 

VxCAo, Ax) = - J X 1 A 0t eX dydcj ) + v2 J X^dydfi, 

RxT 1 IxT 1 

V 2 (A , A:) = V (A!) - I Xi (^loXuy + V oe A J dydep + f j X\X 2 dydep. 



IxT 1 RxT 1 



Finally, due to L = K — V and H = K + V + I all assertions are direct consequences of these identities. □ 
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Lemma 3.14. If the moving frame speed c is given by 

cuj = -0'(9)0(9) = - sin 9 (3.29) 

then Li|p = — Hi|p = 0. Otherwise the Lagrangian and Hamiltonian equations to Li and Hi have no 
non-trivial solution at all. 

Proof. A direct calculation shows 

Ii(X ) = -2Ki(X ) = 2wc / (B ly B 2 -B 1 B 2y )dy = Aujc J B ly B 2 dy, 

R R 

and using 

X (yoeXy + ^oe^-yj = 2 sin #(-Bi cos 4> + B 2 sin 0) {—B\ y sin (f> + B 2y cos 0) 
as well as = / (£?i cos (j> + B 2 sin 0) 3 d</> we find 



Vi(X o ) = -sin0 /" (B 1 B 2y - B ly B 2 ) dj/ = 2sin0 / B ly B 2 dy, 

R R 

so that Li|p = — Hi|p = for (|3.29[) . Finally, for other values of c the Lagrangian equations for Li|p 
equal B\ y — B 2y — Q. □ 

Condition (|3.29jl implies that the moving frame moves with the negative group velocity associated 
to the dispersion relation (|3.25|) (the negative sign appears since our phase definition is <fi — Lot + 9rj) . 
Compare this with the case c 2 ^ (0) from 



Next we prove that the non-resonance condition (|3.26p provides the higher order correction X\ in depen- 
dence of the first order solution Xq. 

Lemma 3.15. Suppose uj 2 = 2 (9), cuj = —0' (9)0(6), and the non-resonance condition (|3.26j) . and let 
Xq be fixed. Then, each solution Xi to the equation 

d Xl U(X , X 1 ) = 

satisfies X\ — X\(Xq) £ Pq, where the special solution Xi(Xq) is given in the proof. Moreover, for each 
Xq G Pq we have 



L 

with 



2 (x , X 1 (X )+Xq) = -H 2 (X , X 1 (X ) + Xq) =-Y 2 (X q ) 



V 2 (X ) = cj (B 2 + B 2 2 ) 2 dyd4>, G = || 

RxT 1 

Proof. The choice of c implies 



1 



4(4w 2 - 2 (29)) 2v 2 



U(Xq, X x ) = U(Xi) - f / X,X 2 dyd^, 



2 

IxT 1 



and hence the equation for X\ becomes 

oj 2 X 14>4> - A , e X l + v 2 X, = -^X 2 = -f f ^1 + Sl^L CO s 20+^1 sin 2 A _ (3 . 30) 
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This equation can be solved explicitly by Fourier transform with respect to <f> and noting that the operator 
w 2 (?00 — Ao, e + v 2 is symmetric with kernel orthogonal to X§. Some elementary analysis shows that each 
solution (|3.30|) can be written as X\ = Xi(Xq) + Xq, where Xq G Pq and 

Xi(Xo) = CiXq + C 2 , C\ — —- — 5 — 3 n9/ofl ,, , C2 = — [ C\ + — — 



2(4w 2 -ft 2 (26>))' ' \ 1 2v 2 J 2tt ' 
Multiplying (|3.30p by X\ and integrating over RxT 1 gives 

2L (x 1 (Xo)+Xo) = f J (x 1 (X )+X )x$dyd<f> = f j X^X^dyd^ 

IxT 1 RxT 1 

for all Xq G Pq, and hence we find 

t 2 (x Q , X 1 (Xo)+X ) =U(X , X X (X )) = -f J (C\X 2 + C 2 )X 2 dydcj) 

RxT 1 

which implies the desired result. □ 

Finally, we combine all results and obtain the macroscopic model on TPq. 

Theorem 3.16. Under the assumptions made in Lemma \3.15\ the reduced Lagrangian and Hamiltonian 
are given by 



with 



L rod (A Ar) = K red (A A-) - V rcd (A), H red (A, A T ) = V rod (A), 
K Icd (A, A T ) = ±2ttuj J (AA T - A T A) dy, 

R 

¥ red (A) = 27rp! / \A y \ 2 dy + 2np 2 [ \A\ 4 dy, 



where the constants p\ and p 2 are given in (|3.34[1 . and A = {B\ — ±B 2 ) / (2y / 7r) is the complex-valued 
amplitude. Moreover, in terms of (A, A T ) the reduced symplectic matrix S Icd is given by 



£ rcd =4™i(J °J, (3.31) 

and both the reduced Lagrangian and Hamiltonian equations are equivalent to 

i2 ujA t = piAyy — 2p 2 \ A\ 2 A, (3.32) 
which is a nonlinear Schrddinger equation. 

Proof. Using the results from Lemma 13.111 and Lemma 13.131 we end up with 

L(X , X x ) = e 3 (K 2 (X a ) - Y 2 (X ) +l 2 (X Q , X 1 )) + 0(e 4 ), 

W{X Q , X^ = e 3 (K 2 (X Q ) + Y 2 (X Q ) +1 2 (X Q ) + U 2 (X Q , X 1 )) + 0(e 4 ), 

where we have used that Lj(Jfo) = Mi(Xo) = for i = 0, 1 and Xq G Pq, compare Theorem 13.121 and 
Lemma T3. 141 Moreover, due to Lemma T3. 151 we can eliminate X\, and this yields 

L rod (X ) = e 3 (K 2 (X ) - Y 2 (X Q ) - %(X )), 
W cd (X ) = e 3 (K 2 (X ) + Y 2 (X ) +h(X ) + V 2 (X )), 
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compare Case C2 in i)2.3l Inserting (|3.27[) into the formulas from Lemma [3.111 gives 

h(X ) = -c 2 J ((£? ly ) 2 + (B 2 ,) 2 ) dy-uj (B lT B 2 -B 1 B 2T )dy, 

R E 

K 2 {X ) = \c 2 J ({B ly ) 2 + (B 2y j 2 ^dy + u J (B lr B 2 - B x B 2t ) dy, 

R R 

V 2 (X o ) = ±cos0 1 ([B ly f + (B 2y ) 2 ) dy+^J (Bf + B*) 2 dy. 

R R 

By construction, i?i and B 2 satisfy B\ = y / 7r(A + A), B 2 = iy / 7r(A — A), and thus we find B\ T B 2 
BiB 2t = ±2tt (AA t - ^1 T 34) as well as 



{B ly ) 2 + (B 2y f = 4mA y A y = 4ir\A y \ 2 , (Bi* + B 2 2 ) = \Qn 2 (A A) = 16tt 2 \A\ a 



We define 



and 



v 4 v 3 



p 1 :=n(6)n"(6)=cos0-c 2 , p 2 ;=^- 



4 2v 2 4(4w 2 - n 2 (26)) 



(3.34) 



K rcd := ljJ (B\ T B 2 - B x B 2t ) dy, 

V rcd :=f / ((5g 2 + (5 2 ,) 2 )dy+il / (5f + 5l) 2 d y , 

R R 

and this implies the formulas for L rcd and H rcd . To compute S rcd , recall X = £ 3 X 2 +C(e 4 ) independent 
of Xi , and notice that the ansatz (|3.27|) can be written as 



X 

X T 



= J- 



(B X \ 

B 2 

Bu 
\B 2T J 



, T = 



1 / cos <j> sin 
\ cos0 sin( 



with T : TP Q -> TP. The adjoint operator T' :TP^ TP Q reads 



X sin 



X T cos 



and with respect to the variables £?i, B 2 , B\ T , B 2r we find 



X T sin 



ircd 



V /2^d 



T = 2lu 



(° 


-i 





°\ 


1 























V° 








0/ 



which implies (|3.3ip . From this and (|3.33p we conclude that both the reduced Lagrangian and Hamiltonian 
equations on Pq read 

~2ujB 2t = - P iB lyy + ± P2 {B\ + B 2 2 )B X , 
+2loB 1t = - P iB 2yy + ±pi{B\ + B 2 )B 2 

and rewriting this in terms of A we find (|3.32p . □ 
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As before, Theorem 13.161 concerns a reduced macroscopic model on Pq that is obtained by means 
of formal expansions. In particular, it is not obvious that the nlS equation (|3.32p combined with the 
modulation ansatz (|3.28|) yields approximate solutions for the KG chain. However, the careful residual 
analysis from |GM04| IGM06] provides rigorous justification results, and thus we can regard Pq as an 
approximate invariant manifold. 

Remark 3.17. Like for the KdV example, the terms lo and Ii, which cause the cancelations in Lo and 
Li, provide macroscopic conservation laws. In fact, with some calculations we find 

Io~y \A\ 2 dy and h ~ / lm(A y A) dy, 

B R 

which equal the macroscopic integrals of motion associated with the symmetries under phase shifts A i— > 
e ls A, and shifts in the y-direction, respectively. 



3.5 Three-wave-interaction for the KG chain 

Here we discuss the interaction of three pulses in the KG chain, see (|1.16[) . More precisely, we consider 
three pulses p\, p 2 , and p^, and aim to understand how the resulting microstructure is modulated on the 
hyperbolic scale for space and time. 

Pulses in the KG chain We briefly summarize some aspects of pulses, and refer to G M041 rGia08[ 
IGMS07) for more details. A plane wave is a solution to the linearized chain 

x 3 (t) = Ae i ^ t+e ^ + c.c. = Ae +i ^ t+e ri + fa-H^W , j G Z, 

with complex amplitude A, frequency lo, and wave number 9. Notice that (—0, —lo) gives the same pulse 
as (9, lo), whereas (9, —lo) is the pulse that travels in opposite direction. Obviously, each plane wave 
must satisfy the dispersion relation 

lo 2 = Q 2 (9) =v 2 + 2a(l - cos 9) with v 2 = $ '(0). 

Since the amplitude A can always be chosen arbitrarily, we can identify each plane wave with a point in 

V = {(9, lo) : lo 2 = n 2 {9)} C T 1 x I. 

In what follows we assume the stability condition 

minUa + v 2 , v 2 } = min Vl 2 {9) > 0, (3.35) 

so that each single plane wave is a stable solution to the linearized chain. 

A simple pulse is a modulation of a plane wave by a slowly varying amplitude 

xf\t) = eA k (et,ej)e 1 ^ kt+e ^ +c.c. 

On the hyperbolic scale r = et and y = ej a pulse will simply travel with group velocity c k = fi'(#fc). 
However, if different pulses associated with p k G V meet each other they interact in case their frequencies 
and wave vectors are in resonance. Three plane waves p\,p 2 ,p^ € V are called in three-wave resonance if 
there exists a choice of three signs m, G { — 1, +1} such that m,\p\ + ra-ipi + m^ps = (0, 0) 6 T 1 x K. By 
using complex conjugates and replacing p k by — p k if necessary, we can always assume that 

Pi+P2+ P3 = 0, ,c, { Wi + U}2+ul3 = Q £ ^ (3.36) 
This resonance condition arises naturally as it is equivalent to the cancelation of oscillations via 
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and guarantees that the product of two pulses contains oscillatory terms that appear also in the third 
pulse. 

Of course, the KG chain allows for resonances between more than three pulses, but in our context 
these can be ignored for the following reason. According to p,16|) . the pulse amplitudes scale with s, 
so that three-pulse resonances, which are related to quadratic products such as x\x 2 , correspond to the 
power e 2 . Interactions of more than three pulses, however, contribute to order ©(e 3 ), and are thus not 
relevant on the hyperbolic scale. 

However, to make the presentation as simple as possible we now assume that the three plane waves 
Pi,p 2 , and P3 are chosen such that except for (|3.36j) there are no further resonances. More precisely, we 
define 

z = {{k l7 k 2 ) g i? : (uj 1 k 1 +cj 2 k 2 ) 2 = n 2 (e 1 k 1 +e 2 k2)}, 

and make the following assumption. 

Assumption 3.18. The vectors pi,p 2 G V are chosen such that 

£ = {(1,0), (-1,0), (0,1), (0,-1), (1,1), (-1,-1)}. 

Obviously, we always have (±1, 0) G Z by p± G V and similarly (0, ±1) G Z by p 2 G V . If additionally 
P3 G V satisfies the three- wave resonance condition (|3.36[) . then ±(1, 1) also lies in Z. Thus, Assumption 
13.181 already implies (|3.36p and additionally excludes any further resonances involving these three plane 
waves. 

Remark 3.19. (i) According to |Gia08j . the resonance condition (|3.36|) is equivalent to 

a 2 ^1^2(^1+^2) + (2afiifi 2 +8)y / (a(ii+5)(ct[i 2 +8) + 5a(^i^i 2 +^ii+^ 2 ) + j5 2 = 

with fii = (1 — cos#i)/2 and 5 = v 2 /4. Hence, for a > (attractive nearest-neighbour interactions) 
the resonance condition cannot be satisfied as the stability condition ()3.35(1 implies 5 > 0. However, 
for a G {—\v 2i —jqV 2 ) (repulsive case) the stability condition is still satisfied, but now there exists a 
one-parameter family of solutions (/ii , fi 2 ) . 

(ii) In general, it is not easy to check the non-resonance conditions implied by Assumption 13.181 i.e., to 
prove that no further plane waves are contained in Z. The mapping Z 2 3 k (k-(6\, 62), k-{oJ\, CO2)) G 
T'xK may have a dense image and hence comes close to the set V very often, giving rise to a small 
divisor problem. However, by varying also a and v 2 , it is possible to choose 6*1,6*2 as rational multiples 
of 7r and to make loi/uj 2 rational as well. Then, the image of the above mapping hits every bounded set 
in finitely many points. Then, Assumption 13.181 appears very reasonable. 

(iii) In Remark 13.251 below we provide a weaker variant of Assumption 13.181 

Invertible two-scale ansatz The resonance and non-resonance conditions imposed by Assumption 
I3.18l implv that there exist exactly two independent phase variables. Therefore, concerning the embedding 
of the microscopic system, it is necessary and sufficient to introduce a two-dimensional phase variable 
= (0i,0 2 )GT 2 , i.e., 

Q = L 2 (RxT 2 ; dnd<j)), P = L 2 (RxT 2 ; dydcf). 
Similarly to the nlS example we start with the invertible two-scale ansatz 

x(t, -q, cj>) = eX(et, sr], 4> + u>t + 07]), (3.37) 
so that the corresponding inverse two-scale transformation T vc i(e, t) : TP — > TQ reads 

(T ve i(e, t)(X, X T ))(r7, <f>) = (eX, e 2 X T + eu> ■ A^) (en, <f> + ujt + 6rj) (3.38) 

with 6 = (0i, 2 ), oj = (u>i, u> 2 ), and = (d^, d<f, 2 ). 

Remark 3.20. The ansatz (pHTf provides T con (e, t) = T~* o M coa {-t) o § 

con(^) ■ P — > Q with inverse 

scaling transformation (S CO n(e)A)(?7, cf>) — eX(er\, </>), weak symmetry transformation (T ccm x)(n, </>) = 
x(r], 4> — 0if)i and moving frame transformation (Ai CO n(t) x)(rj, <f) = x(rj, 4> — cot) associated to the inte- 
gral of motion X(x, x t ) = —u ■ X phaso (x, x t ) = - J RxT2 x t (u> ■ x^)dnd(f>. 
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Leading order reduction We start with the computation of the transformed structures. 

Lemma 3.21. The transformation (|3.38[) provides L = K — V. E = K + V, and H = E + I. as well as 
the following expansions: 

1. I = el + e 2 Ii +C(e 3 ) with 

MX) = - j \u ■ X^f dyd^ h(X,X T ) = -J X T (wX^dyd4>, 

RxT 2 RxT 2 

2. K = eKq + e 2 Ki + C(e 3 ) with Kq = -§ I and K x = 

3. V = eY + s 2 Y 1 +0(e 3 ) with 

Y (X) = -§ J XAo.eXdydcj) + f / X 2 dyd0, 

RxT 2 RxT 2 

MX) = -f f x(v+ e + V- )x v dyd<f> + f[x 3 dydcf>, 

RxT 2 RxT 2 

where a = &{(0), v 2 = $(,'(0), and w 3 = $d"( )- 
Moreover, the matrix X corresponding to <x obeys the exact expansion 



o oy ' \1 

so that X is non- degenerate due to £2. 

Proof. Analogously to the proof of Lemma 13.101 we find the equations for L and H along with 
l(e, I,I T )=IoT vc ,(£,t)= £ - 1 J (e 2 X T + ecj-X^)(~euj-X lP )dyd^ 

RxT 2 

K(e, X, X T ) =/CoT vol (e, t) = e' 1 J \{e 2 X T + ecu ■ d^X) 2 dydcp, 



RxT 2 



W(e,X)=VoT vel (e,t)=e- 1 I (-e 2 f XA £ . 9 X + & (eX)) dyd<b. 

RxT 2 



Moreover, the expansions with respect to e follow from direct calculations, and using 

e 0' 
j ■ dj, 1 

the matrix £ can be calculated by means of Remark 12.271 □ 



As a consequence of Lemma [3.211 we obtain So = and Lo(AT) = — Mo(X), and the leading order 
equation 

(w ■ d^fX - aA . e X + v 2 X = 0. 

is again quasi-stationary. Applying Fourier transformation with respect to cj>, a general function X has 
the form X(y, <fi) = X^-ez 2 Fk(y) elk an( i solves the above equation if and only if Fk = for all k £ Z. 

Lemma 3.22. Under Assumption \3.1~S\ the leading order Lagrangian and Hamiltonian equations are 
quasi- stationary, and all solutions are given by 

3 

X (y, <t>) = J2 My)e i4 "> + ex., (3.39) 

n=l 

with 03 = — </>i — 4>2 and arbitrary A n £ i 2 (R; C), n = 1,2, 3. Moreover, we have Lo(Xo) = W${Xo) = 
for all X with ([539"]) . 
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Elimination of the microstructure As outlined in £12.31 we derive the reduced macroscopic model 
by restricting the next-leading order terms to the space 

Po = {X Q eP : dxU(Xa) = d Y H (A ) = 0} £* {A = (A U A 2 ,A 3 ) e (L 2 (R; C)) 3 }. 

Notice that, in contrast to the nlS example from ^3.41 here Li|tp and Hi|tp do not vanish, and provide 
the reduced Lagrangian and Hamiltonian. In particular, we need not care for the correction terms coming 
from X = X + eX x . 

Theorem 3.23. Under Assumption lS. 18\ the reduced Lagrangian and Hamiltonian are given by 
U od (A, Ar) = K rcd (A, A?) - V rcd (A), W eA {A, A T ) = V rod (A), 

with 

3 

W cd (A, Ar) = i £ u n J A n A nr dy + c.c. 



n=l 



Y rcd (A) =v 3 J AiA 2 A?, Ay + i ^ dy + cc, 

R 71=1 K 

where oj n uj' n = fl(9 n )fl'(9 n ). Moreover, 



-2i 



n cp 

0/ ' 



ft 



<u x 

L0 2 

to 3 . 



is the reduced symplectic matrix, and the reduced Lagrangian and Hamiltonian equations are equivalent 
to the three-wave-interaction equations (|1.17|) . 

Proof. According to §2.31 we have 

L red = cKx \ TPo - cVi \tp„ , H rcd = cKx \ TPo + cV x |tp„ + cl x | T p n , <x rcd = »i |tp , 



where for convenience we introduced a trivial scaling by c = 1/ (47r 2 ). Inserting (|3.39|) into the formulas 
from Lemma 13.211 and exploiting Assumption 13.181 we obtain 



3 , 

cK x {A, A T ) = -di(A, A T ) = ^2 / iu n A n A nT dy + c.c, 

c¥ 1 (A)= (X^/ i-U n aj' n A n AZydy + v 3 J A 1 A 2 A 3 dy 



c.c, 



where we used asin(9 n = u> n u:' n and the properties of V and A, see Remark 13.11 Concerning S rcd we 
observe that the ansatz (|3.39j) can be written as 



X 
X T 



A 2 
A 3 
Ait 

Mt 



C.C. 



o — 



,101 e ±ip 2 e -i(0i+0 2 ) Q 











,101 _102 







with T : TP -> TP. The adjoint operator T' :TP-> TP a reads 



/ 6 



T' 



A 

A T 



T- 



-i<t>l 
i(0i+0 2 ) 



e 



V o 



\ 




e -ft>l 

1(01+02) J 



X 

X T 
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and wc find 



^ = cT' V 1 T =cT>(- 2 " Q 9 * 2i (o o)- 

Finally, the Lagrangian equations to L red are given by 

d T (&jU od (A, Ar)) - &jU cd (A 7 A T ) = 

and equal 

-2±flA T = %H rcd (A), 

which is the Hamiltonian equation to (H red , <r red ). Finally, both equations coincide with (JTTTTJ). □ 

As mentioned in the introduction, one can obtain the macroscopic equations (|1.17|) also by inserting 
the two-scale ansatz (|1.16l) into the Klein-Gordon chain (|3.2I) and requiring the coefficients of the terms 
£ 2 e i(w„t+e„j) £ Q vanish. Based on this formal expansion one can then justify the validity of (|1.17p , see 
|Gia06l lGla08] and §7.2 in |GHM06j . 

Remark 3.24. Inserting (|3.39p into the formulas from Lemma T3 . 2 1 1 and exploiting the resonance condi- 
tion, we obtain 

3 . 

cK \ Po = -|cIo|p = cV |p = icE |p = XX / \A n \ 2 dy. 

11=1 R 

These equalities reflect the cancelation in Lo and manifest the cquipartition of energy for plane-wave 
solutions. Moreover, the total energy Eo is the first integral associated to the invariance under phase 
shifts. 

Remark 3.25. Assumption ^. 18l which excludes all other possible resonances except for p\ +p2 +P3 = 0, 
can be weakened as follows. As already mentioned, on the hyperbolic scale we can ignore resonances of 
more than three pulses. We shall, however, exclude the possibility that further pulses are created via 
three-pulse resonance, because otherwise we expect the three-pulse solution that involves p\, P2, and p^ 
to be unstable on the hyperbolic scale. This gives rise to the non-resonance conditions 

2pi, 2p 2 , 2p 3 , p x - p 2 , Pl -P3,P2-P3 $V\ {±Pl, ±P2, ±Pa}- 

Assuming this, it can happen that there exist further degenerate three-pulse resonances between pi, 
P2, and p^, as for instance 2p\ -\- ps = or 2p\ — P2 = 0. In this case we still obtain a stable three- 
pulse solution, but the reduction procedure provides a different modulation equation. In fact, such 
degenerate resonances give rise to further cubic coupling terms in the formula for V red , as for instance 
^ J R (^-i + A^A^dy + c.c. or ^ J R AfA2<iy + ex., respectively. Altogether, in order to guarantee that 
(|1.17p is a reasonable macroscopic model it is sufficient to assume the resonance condition 

(1,0), (0, i), (l, i)ez 

and the non-resonance conditions 

(0, 2), (2, 0), (1,-1), (2, 1), (1, 2), (2, 2)£Z. 

3.6 Outlook to further examples 

Finally, we give a brief overview on two other classes of micro-macro transitions that can also be studied 
with respect to Hamiltonian and Lagrangian reductions. However, since these examples lead to additional 
problems, their investigation is left for a forthcoming study. 
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Coupled systems describe the interactions between modulated pulses and waves with long wave- 
length. The interesting feature here is that the corresponding two-scale ansatz 

x{t, r), 0) = e a X(et, ej) + e p A{et, ej)e +i{ut+e ^ + c.c. (3.40) 

combines contributions with different orders of magnitude. For instance, if we derive the effective macro- 
scopic model for a = and (3 = 1 by inserting (|3.40[) into the microscopic equation of motion, we 
find 

d TT X = c^dyyX, id T A= icgr d y A-p d y X A. (3.41) 

However, the asymmetric coupling between both equations prevents (|3.41[) from being the Eulcr-Lagrange 
equation of a suitable chosen macroscopic Lagrangian with variables X and A, and we conclude that the 
reduction of Lagrangian and Hamiltonian structures yields a different reduced model. 

Whitham's modulation theory is another example postponed to our forthcoming paper. This theory 
was originally developed in the context of PDEs, see [Whi74. Kam06|, but can also be applied to discrete 
systems, see for instance |HLM94i IF V99j . The main building block for Whitham's modulation theory 
are periodic travelling waves. These are exact solutions to ([1.13JI satisfying Xj(t) = X{uot + 9j) with 
(p = u>t + 6j. For the atomic chain the profile X must fulfil the following advance-delay differential 
equation 

to 2 X^) = $1 (x(<f> + 9)- X(0)) - $1 (x{<j>) - X{<j> - 0)) - & (X(4>)). 

In case that both (fr' Q and $' x are linear, we can solve this equation by means of Fourier transformation, 
and will recover plane waves with (|3.3p . but for nonlinear potentials more sophisticated methods are 
necessary, compare for instance |DHM06| and references therein. The basic ideas behind Whitham's 
modulation theory can be summarized as follows: We consider the KG chain and start with the following 
two-scale ansatz 

xj(t) = X(ej, et, e^Qiej, et)). 

Here, is the modulated phase and provides the fields of wave number and frequency via lu(t, y) = 
d T Q(r, y) and #(r, y) = d v Q{r, y), and for each (r, y) the function cf) ^ X(t, y, </>) is assumed to be a 
periodic travelling wave. Whitham's approach to the Lagrangian reduction allows to derive easily the 
corresponding macroscopic model. For the KG chain we find two nonlinear conservation laws 

d r 9{T,y)-d y u>(T,y) = 0, d T S(r, y) + d y g{r, y) = 0, (3.42) 

which are closed by the Gibbs equation dL = S du> + gd9 and the equation of state L = L(9, oj), which 
provides the action of a travelling wave as a function of uj and 9. Moreover, it can be shown that ([3~42]) 
is a system of Hamiltonian PDEs. 

The new feature appearing in this example is that the corresponding two-scale transformation depends 
on the modulated phase 0, which in turn depends on the solution to the macroscopic equation. In other 
words, within Whitham's modulation theory we do not know the two-scale transformations a priori and 
this complicates the reduction of Lagrangian and Hamiltonian structures. Finally the modulation theory 
for FPU chains leads to further complications, since the Galilean invariance of (|3.1|) causes a coupling 
between macroscopic waves and modulated oscillations, see [FV99[ IHer05l IGHM061 IDHR061 lD"H07j . 

References 

[AM78] R. Abraham and J.E. Marsden, Foundations of Mechanics, 2. cd., Perseus books, Cambridge 
Massachusetts, 1978, Updated 1985 Printing. 

[BG02a] A. Braides and M.S. Gelli, Continuum limits of discrete systems without convexity hypotheses, 
Math. Mech. Solids 7 (2002), 41-66. 



37 



[BG02b] , Limits of discrete systems with long-range interactions, J. Convex Anal. 9 (2002), 

363-399, Special issue on optimization (Montpellier, 2000). 

, From discrete systems to continuous variational problems: an introduction, Topics 



[BG06] _ 

on concentration phenomena and problems with multiple scales (A. Braides and V.C. Piat, 
eds.), Springer, 2006, pp. 3-78. 

[BLM06] T. Blesgen, S. Luckhaus, and L. Mugnai, Discrete free energy functionals for elastic materials, 
Analysis, Modeling and Simulation of Multiscale Problems (A. Mielke, ed.), Springer, 2006. 

[CJLL06] D. Cohen, T. Jalmke, K. Lorenz, and Ch. Lubich, Integrators for highly oscillatory Hamil- 
tonian systems, Analysis, Modeling and Simulation of Multiscale Problems (A. Mielke, cd.), 
Springer, 2006. 

[DafOO] CM. Dafcrmos, Hyperbolic conservation laws in continuum physics, Grundlchrcn d. mathem. 
Wissenschaften, vol. 325, Springer, Berlin, 2000. 

[DH07] W. Dreyer and M. Herrmann, Numerical experiments on the modulation theory for the non- 
linear atomic chain, Physica D (2007), in press, see doi : 10 . 1016/j . physd.2007.09.003. 

[DHM06] W. Dreyer, M. Herrmann, and A. Mielke, Micro-macro transition for the atomic chain via 
Whitham's modulation equation, Nonlincarity 19 (2006), no. 2, 471-500. 

[DHR06] W. Dreyer, M. Herrmann, and J. Rademacher, Pulses, traveling waves and modulational the- 
ory in oscillator chains, Analysis, Modeling and Simulation of Multiscale Problems (A. Mielke, 
ed.), Springer, 2006. 

[DM98] P. Dcift and T.-R. McLaughlin, A continuum limit of the Toda lattice, Mem. Americ. Math. 
Soc, vol. 131/624, American Mathematical Society, 1998. 

[E105] G.A. El, Resolution of a shock in hyperbolic systems modified by weak dispersion, Chaos 15 
(2005), 037103. 

[FJ00] G. Friesecke and R.D. James, A scheme for the passage from atomic to continuum theory for 
thin films, nanotubes and nanorods, J. Mech. Phys. Solids 48 (2000), no. 6-7, 1519-1540. 

[FP99] G. Friesecke and R.L. Pego, Solitary waves on FPU lattices. I. Qualitative properties, renor- 
malization and continuum limit, Comm. Math. Phys. 12 (1999), no. 6, 1601-1627. 

[FT02] G. Friesecke and F. Thcil, Validity and failure of the Cauchy-Born hypothesis in a two- 
dimensional mass-spring lattice, J. Nonlin. Sci. 12 (2002), 445-478. 

[FV99] A.-M. Filip and S. Venakides, Existence and modulation of traveling waves in particle chains, 
Comm. Pure Appl. Math. 51 (1999), no. 6, 693-735. 

[GHM06] J. Giannoulis, M. Herrmann, and A. Mielke, Continuum descriptions for the dynamics in 
discrete lattices: derivation and justification, Analysis, Modeling and Simulation of Multiscale 
Problems (A. Mielke, ed.), Springer, 2006. 

[Gia06] J. Giannoulis, Three-wave interaction in discrete lattices, PAMM 6 (2006), 475-476. 

[Gia08] , Interaction of modulated pulses in nonlinear lattices, in preparation, 2008. 

[GKM96] Zh. Ge, H.-P. Kruse, and J.E. Marsden, The limits of Hamiltonian structures in the three 
dimensional elasticity, shells, and rods, J. Nonlinear Sci. 6 (1996), no. 1, 19-57. 

[GKMS95] Zh. Ge, H.-P. Kruse, J.E. Marsden, and C. Scovel, The convergence of Hamiltonian structures 
in the shallow water approximation, Canad. Appl. Math. Quart. 3 (1995), no. 3, 277-302. 

[GM04] J. Giannoulis and A. Mielke, The nonlinear Schrddinger equation as a macroscopic limit for 
an oscillator chain with cubic nonlinearities, Nonlincarity 17 (2004), 551-565. 



38 



[GM06] 

[GMS07] 

[Her05] 

[HFM81] 

[HLM94] 

[HLW02] 

[Jar93] 

[Kam06] 

[KSM92] 

[Lax86] 
[Lax91] 

[LLV93] 

[Mie02] 

[Mie06] 

[Mie08] 

[NV05] 

[Sch06] 

[SU07] 

[SWOO] 

[Thc06] 
[TR99] 



, Dispersive evolution of pulses in oscillator chains with general interaction potentials, 

Discr. Cont. Dynam. Systems Ser. B 6 (2006), 493-523. 

J. Giannoulis, A. Mielke, and Ch. Sparber, Interaction of modulated pulses in the nonlinear 
Schrddinger equation with periodic potential, WIAS preprint 1221, 2007. 

M. Herrmann, Ein Mikro-Makro-Ubergang fur die nichtlineare atomare Kette mit Temperatur, 
Phd thesis, Humboldt-Universitat zu Berlin, 2005. 

B. L. Holian, H. Flaschka, and D.W. McLaughlin, Shock waves in the toda lattice: Analysis, 
Phys. Rev. A 24 (1981), no. 5, 2595-2623. 

M.H. Hays, CD. Levermore, and P.D. Miller, Macroscopic lattice dynamics, Physica D 79 
(1994), no. 1, 1-15. 

E. Hairer, Ch. Lubich, and G. Wanner, Geometric Numerical Integration, Springer Series in 
Comp. Mathem., vol. 31, Springer, Berlin, 2002. 

C. Jaryzynski, Multiple-time- scale approach to ergodic adiabatic systems: Another look, Phys. 
Rev. Lett. 71 (1993), no. 6, 839-842. 

A.M. Kamchatnov, Nonlinear periodic waves and their modulations, World Scientific, Singa- 
pore, 2006. 

P. Kirrmann, G. Schneider, and A. Mielke, The validity of modulation equations for extended 
systems with cubic nonlinearities, Proc. Roy. Soc. Edinburgh Sect. A 122 (1992), 85-91. 

P.D. Lax, On dispersive difference schemes, Physica D 18 (11986), 250-254. 

, The zero dispersion limit, a deterministic analogue of turbulence, Comm. Pure Appl. 

Math. 44 (1991), 1047-1056. 

P.D. Lax, CD. Levermore, and S. Venakides, The generation and propagation of oscillations 
in dispersive initial value problems and their limiting behavior, Important developments in 
soliton theory (A.S. Fokas and V.E. Zakharov, eds.), Springer, 1993, pp. 205-241. 

A. Mielke, The Ginzburg-Landau equation in its role as a modulation equation, Handbook of 
Dynamical Systems II (B. Fiedler, ed.), Elsevier Science B.V., 2002, pp. 759-834. 

, Macroscopic behavior of microscopic oscillations in harmonic lattices via Wigner- 



Husimi transforms, Arch. Ration. Mcch. Anal. 181 (2006), no. 3, 401-448. 

, Weak- convergence methods for Hamiltonian multiscale problems, Discr. Cont. Dynam. 



Systems Series A 20 (2008), no. 1, 53-79. 

A. I. Neishtadt and A. A. Vasiliev, Destruction of adiabatic invariance at resonances in slow- 
fast hamiltonian systems, preprint, 2005. 

B. Schmidt, A derivation of continuum nonlinear plate theory from atomistic models, Multi- 
scale Model. Simul. 5 (2006), no. 2, 664-694. 

G. Schneider and H. Uecker, The mathematics of light pulses in dispersive media, Jahresber. 
Deutsch. Math.-Verein. 109 (2007), no. 3, 139-161. 

G. Schneider and C.E. Wayne, Counter-propagating waves on fluid surfaces and the continuum 
limit for the Fermi- Pasta- Ulam model, International Conference on Differential Equations 
(K. Groger, B. Fiedler, and J. Sprckcls, eds.), World Scientific, 2000, pp. 390-403. 

F. Thcil, A proof of crystallization in two dimensions, Comm. Math. Phys. 262 (2006), no. 1, 
209-236. 

N.P. Trctiakov and J.N. Tcixcira Rabelo, Fast driving: Effective equations of motion for 
classical systems, Europhys. Lett. 48 (1999), no. 2, 143-149. 



39 



[Whi74] G.B. Whitham, Linear and Nonlinear Waves, Pure And Applied Mathematics, vol. 1237, 
Wiley Interscience, New York, 1974. 



40 



